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Release 3.1A John F. Collins, Biocomputing Research Unit. 
Copyright (c) 1993-1998 University of Edinburgh, 0,K, 
Distribution rights by Oxford Molecular Ltd 

MPsrchjp protein • protein database search, using Smith-Waterman algorithm 

Run on; Pri Jan 22 18:,C 

Tabular output not generated. 



3;40 1999; MasPar time 7,92 £ 

830,441 Million cell updates/sec 



Title: 

Description: 
Perfect Score; 
Sequence: 

Scoring table; 
Searched; 



>CS-09-162-597-3 

(1-245) from OS09162597.pep 

1940 

1 MTLFHFGNCFALAYFPYFIT GLVSQTLMYLFPASLQVLVK 245 

PAM 150 
Gap 11 

74019 seqs, 26840295 residues 



Post-processing; Minimum Match 0% 

Listing first 45 summaries 



swiss-prot35 
l;swissprot 



istics: Mean 47,613; Variance 90,624; scale 0.525 

Pred. No, is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SDMMARIES 



Result 



Query 



No. Score 


Match Length D 


3 ID 


Description 




Pred. No. 


1 118 


6.1 572 


PTIJIYCPN 


PHOSPHOENOLPYROVATE-PR 


2.41e-03 


2 113 


5.8 215 


CYB6_0RYSA 


CYTOCHROME B 


(EC 1,10 


1.21e-02 


3 113 


5.8 215 


CYB6J1AIZE 


CYTOCHROME B 


(EC 1.10 


1.21e-02 


4 113 


5.8 215 


CYB6_WHEAT 


CYTOCHROME B 


(EC 1.10 


1.21e-02 


5 ill 


5.7 215 


CYB6_T0BAC 


CYTOCHROME B 


(EC 1.10 


2.27e-02 


6 106 


5.5 167 


NU6M_APIME 


NADH-OBIQDINONE OXIDOR 


1.07e-01 


7 107 


5.5 215 


CyB6_SYNP7 


CYTOCHROME B 


(EC 1.10 


7.89e-02 


8 107 


5.5 215 


CYB6_CHLRE 


CYTOCHROME B 


(EC 1.10 


7.89e-02 


9 106 


5.5 215 


CYB6_N0SSP 


CYTOCHROME B 


(EC 1.10 


1.07e-01 


10 106 


5.5 215 


CYB6_SPI0L 


CYTOCHROME B 


(EC 1.10 


1.07e-01 


11 105 


5.4 215 


CYBejARPO 


CYTOCHROME B 


(EC 1.10 


1.45e-01 


' 12 104 


5.4 215 


CYB6_0D0SI 


CYTOCHROME B 


(EC 1.10 


1.97e-01 


13 104 


5.4 215 


CYB6_P0RPU 


CYTOCHROME B 


(EC 1.10 


1.97e-01 


14 105 


5.4 222 


CYB6_SYlfP2 


CYTOCHROME B 


(EC 1.10 


1.45e-01 


15 103 


5.3 215 


CYB6_CYAPA 


CYTOCHROME B 


(EC 1.10 


2.66e-01 


16 103 


5.3 222 


CYB6_PR0H0 


CYTOCHROME B 


(EC 1.10 


2.66e-01 


17 103 


5.3 222 


CyB6_SYNY3 


CYTOCHROME B 


(EC 1,10 


2.66e-01 


18 102 


5,3 552 


CYHR_CANMA 


CYCLOHEXIMIDE RESISTAN 


3.58e-01 


19 101 


5.2 215 


CYB6_PINTH 


CYTOCHROME B 


(EC 1.10 


4.81e-01 


20 101 


5.2 365 


NQ08_THETH 


NADH-OBIQDINONE OXIDOR 


4.81e-01 


21 101 


5.2 430 


RFBXJALTY 


POTATIVE 0- ANTIGEN TRA 


4.81e-01 


22 101 


5.2 570 


PTIJACSU 


PHOSPHOENOLPYRDVATE-PR 


4.81e-01 


23 99 


5.1 215 


CYB6.CHLVU 


CYTOCHROME B 


(EC 1,10 


8.64e-01 



24 


99 


5.1 215 


I 


CYB6 CHLPR 






25,.,,, 


....98, 


5,1, . 422- 


I 


SYT2..RAT 


■SYNAPTOTAGMIN II, 


l-.15e+00> 


26 


' 98 


5,1 422 


1 


SYT2_M0DSE 


SYNAPTOTAGMIN II, 


1.15e+00 


27 


98 


5,1 573 


1 


PTlJffCCA 


PHOSPHOENOLPYRUVATE-PR 


1.15e+00 


28 


98 


5,1 592 


1 


0DP2_DICDI 


DIHYDROLIPOAMIDE ACETY 


1.15e+00 


29 


97 


5.0 304 


1 


G5PI CALFI 


GLUCOSE-6'PHOSPHATE IS 


1.54e+00 


30 


97 


5,0 601 


I 


KEFB ECOLI 


GLUTATHIONE"REGULATED 


1.54e+00 


31 


96 


4.9 379 


I 


CYB CRAGR 


CYTOCHROME B (EC 1.10. 


2.04e+00 


32 


95 


4,9 379 


I 


CYB CRAGY 


CYTOCHROME B (EC 1,10, 


2,71e+00 


33 


95 


4,9 775 


I 






9 71p+flfl 


34 


96 • 


4,9 1323 


1 


HST6 CANAL 


ATP-DEPENDENT PERMEASE 


2.04e+00 


35 


95 


4,9 1353 


I 






9 71o+(in 


36 


94 


4.8 151 


\ 


Y4JR RHISN 


HYPOTHETICAL 17.3 KD P 


3.59e+00 


37 


94 


4.8 342 


1 


ISIA_SYNY3 


IRON-STRESS INDUCED CH 


3^596+00 


38 


94 


4,8 379 


1 


CYB.CRAGG 


CYTOCHROME B (EC 1,10, 


3.59e+00 


39 


93 


4,8 382 


1 


LPXBJCOLI 


LIPID-A-DISACCHARIDE S 


4,74e+00 


40 


94 


4,8 491 


1 


Y225JIYCPN 


HYPOTHETICAL PROTEIN M 


3.59e+00 


41 


94 


4.8 578 


1 


YFCOJCOLI 


HYPOTHETICAL OUTER MEM 


3.59e+00 


42 . 


_..91,.... 


.4, 8. .......537 


.1 


YA07.SCHP,0. 


PUTATIVE. 73.,.4..KD TRANS 


Lum. 


43 


94 


4,8 842 


1 


AMPN_LACDL 


AMINOPEPTIDASE N (EC 3 


3.59e+00 


44 


93 


4,8 1035 


1 


EPA6_M00SE 


EPHRIN TYPE-A RECEPTOR 


4.74e+00 


45 


94 


4,8 2470 


1 


T0R1.YEAST 


PHOSPHATIDYLINOSITOL 3 


3.59e+00 



RESULT 
ID 
AC 



1 



572 AA. 



PTIJIYCPN STANDARD; 
P75168; 

Ol-NOV-1997 (REL. 35,^ 

Ol-NOV-1997 (REL. 35, LAST SEQUENCE UPDATE) 
Ol-NOV-1997 (REL. 35, LAST ANNOTATION UPDATE) 
PHOSPHOENOLPYRUVATE- PROTEIN PHOSPHOTRANSFERASE (EC 2,7.3,9) 
(PHOSPHOTRANSFERASE SYSTEM, ENZYME I). 
PTSI. 

MYCOPLASMA PNEUMONIAE. 

PROKARYOTA; TENERICUTES; MOLLICOTES; MYCOPLASMA; MYCOPLASMATALES; 

MYCOPLASMATACEAE. 

[1] 

SEQUENCE FROM N.A. ■ 
STRAIN=ATCC 29342 / M129; 
MEDLINE; 97105885. 

R., HILBERT H., PLAGENS H., PIRKL E., LI B.-C, 

R.; 

NUCLEIC ACIDS RES, 24:4420-4449(1996). 

-I- FUNCTION: THIS IS A COMPONENT OF THE PHOSPHOENOLPYRUVATE-DEPENDENT ■ 
SUGAR PHOSPHOTRANSFERASE SYSTEM (PTS), A MAJOR CARBOHYDRATE ACTIVE 
-TRANSPORT SYSTEM. ENZYME I TRANSFERS THE PHOSPHORYL GROUP FROM 
PHOSPHOENOLPYRUVATE (PEP) TO THE PHOSPHORYL CARRIER PROTEIN (HPR), 
ENZYME I IS COMMON TO ALL PTS (BY SIMILARITY). 

•!- CATALYTIC ACTIVITY: PHOSPHOENOLPYRUVATE + PROTEIN HISTIDINE - 
PYRUVATE + PROTEIN N-PHOSPHOHISIIDINE. 

-I- SUBUNIT: HOMODIMER (BY SIMILARITY), 

- ! - SUBCELLULAR LOCATION ; CYTOPLASMIC . 

■!- SIMILARITY: TO OTHER PEP-UTILIZING ENZYMES. i 

EMBL; AE000021; G1673880; -. 

PROSITE; PS00370; PEP ENZYMES PHOS SITE; 1, 

PROSITE; PS00742; PEPJNZYMESJ; 1. 

PHOSPHOTRANSFERASE SYSTEM; TRANSFERASE; KINASE; SUGAR TRANSPORT; 
PHOSPHORYLATION. 

MODJES 190 190 PHOSPHORYLATION (BY SIMILARITY) . 
572 AA; 63950 MW; 24ED53DF CRC32; 



Query Match 6.1*; Score 118; DB 1; Length 572; 

Best Local Similarity 28,8*; Pred, No. 2.41e-03; 

Matches 23; Conservative 26; Mismatches 25; Indels 6; Gaps 6; 

■Db 446 SIGTN-DLIQYSFAADRMNKNVSY-LYQPLNPALLKLIYLTIEGGKVNDIWTGMCGEMAG 503 

II ;| l;:l I: :: : I l|; ; ||:| |:;|:: : | ; :|: ::: 
Qy 136 SIDSNISLVHYIVASAQVWMITRYDLYHNFRPAVLLLMFLSVYKAFVMEIFVHLCS-LGS 194 

■ Db 504 -EPL-AIPLLLGL-GLKEFS 520 

I I II :|I h 
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Oy 195 WARLDARAWTGLLALKHFG 214 



RESULT 2 

ID CYBS.ORYSA STANDARD; PRT; 215 AA. • 

AC P12123; 

DT Ol-OCT-1989 {REL, 12, CREATED) 

DT Ol-OCT-1989 (REL. 12, LAST SEQUENCE UPDATE) 

DT' Ol-NOV-1990 (REL. 15, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B6 (EC 1,10,99,1), 

GN PETB. 

OS ORYZA SATIVA (RICE), 

OG CHLOROPLAST. 

OC EUKARYOTA; PLANTA; EMBRYOPHYTA; ANGIOSPERMAE; MONOCOTYLEDONEAE; 

OC CYPERALES; GRAMINEAE, 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-CV. NIPPONBARE; 

RA SUGIURAM,; 

JL SUBMITTED (JDL-1989) TO EMBL/GENBANK/DDBJ DATA BANKS, 

RN [2] 

RP COMPLETE GENOME. 

RX MEDLINE; 89364698, 

RA HIRATSUKA J., SHIMADA H., WHITTIER R., ISHIBASHI T., SAKAMOTO M., 

RA MORI M., KONDOC, HONJI Y,, SUNC.-R,, MENG B.-Y., LI Y,-Q,, 

RA RANNO A,, NISHIZAWA Y., HIRAI A., SHINOZAKI K,, SUGIURA M,; 

RL MOL, GEN, GENET. 217:185-194(1989). 

RN [3] - - 

RP SEQUENCE FROM N.A, 

RA COTE J,C,, WU N.H,, WU R,; 

RL PLANT MOL, BIOL. 11:873-874(1988), 

CC •!- FUNCTION: COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 
CC OF THE CHLOROPLASTIC RESPIRATORY CHAIN. 

CC -!- CATALYTIC ACTIVITY: PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN = 
CC PLASTOQUINONE + 2 REDUCED PLASTOCYANIN. 

CC •!- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 
CC BOUND TO THE PROTEIN, 

CC -I- SUBONIT; THE MAIN SUBUNITS OF COMPLEX B6-F ARE: CYTOCHROME B6, 

CC 17 KD POLYPEPTIDE (PETD), CYTOCHROME P AND THE RIESKE PROTEIN. 

CC -!- SIMILARITY: WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B, 

DR EMBL; X15901; G669082; -, 

DR EMBL; M35995; G343209; -, 

DR PIR; JQ0256; CBRZ6. 

DR .MENDEL; 4313; 0RYSA;PETB;1. 

DR PROSIIE; PS00192; CYTOCHROME JJEME; 1. 

KW ELECTRON TRANSPORT; HEME; CHLOROPLAST; PHOTOSYNTHESIS; TRANSMEMBRANE. 



FT 


METAL 


86 


86 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY), 


FT 


METAL 


100 


100 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY), 


FT 


METAL 


187 


187 


IRON 2 (HEME AXIAL LIGAND) 


FT 




202 




(BY SIMILARITY) , 


FT 


METAL 


202 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY), 


SQ 


SEQUENCE 


215 AA; 


24182 MW 


086A69C5 CRC32;, 



Query Match 5,8%; 
Best Local Similarity 23,5%; 
Matches 20; Conservative 



Score 113; DB 1; Length 215; 

Pred, No, 1.216-02; 

27; Mismatches 32; indels 



Gaps 



57 YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I : :: :| |;||:: I |:M : I :::: |:|:| |:: :: | 
132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

114 REL-TWVTGWLAVLTASFGVTGYS 137 

I :| l|:|: ::: :: 
190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT 3 

ID CYB6J1AIZE STANDARD; PF 
AC P05642; 

DT Ol-NOV-1988 (REL. 09, CREATED) 



Ol-NOV-1988 (REL. 09, LAST SEQUENCE UPDATE) 
Ol-FEB-1995 (REL, 33, LAST ANN0TAT,3;0N UPDATE) 
CYTOCHROME B6 (EC 1,10,99,1) . 
PETB. 

ZEA MAYS (MAIZE) , 
CHLOROPLAST. 

EUKARYOTA; PLANTA; EMBRYOPHYTA; ANGIOSPERMAE; MONOCOTYLEDONEAE; ' 
CYPERALES; GRAMINEAE. 
[1] 

SEQUENCE FROM N.A, 
MEDLINE; 88210525, 
ROCK CD,, BARKAN A,, TAYLOR W,C.; 
CURR. GENET. 12:69-77(1987), 
[2] 

SEQUENCE FROM N.A, 
MEDLINE; 95395841, 

MAIER R.M., NECKERMANN K,, IGLOI G.L., KOESSEL H,; 
J, MOL. BIOL, 251:614-628(1995), 
I- FUNCTION: COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 

OF THE CHLOROPLASTIC RESPIRATORY CHAIN. 
I- CATALYTIC ACTIVITY: PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN = 

PLASTOQUINONE + 2 REDUCED PLASTOCYANIN. 
!- COFACTOR: TWO HEME GROUPS (B562 AND B556) WHICH ARE NOT COVALENTLY 

BOUND TO THE PROTEIN, 
!- SUBUNIT: THE MAIN SUBUNITS OF COMPLEX B6-F ARE: CYTOCHROME B6, 
17 KD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN, 
I- SIMILARITY: WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B. 
....„.,.DR,.-.EMBL; X05422; G311718; -, 

DR EMBL; X86563; G902251; ALTJEQ, 
DR PIR; S08592; CBZM6R, 
DR MAIZEDB; 55334; -, 

MENDEL; 4315; ZEAMA;PETB;1. 
PROSITE; PS00192; CYTOCHROME JJEME; 1. 

ELECTRON TRANSPORT; HEME; CHLOROPLAST; PHOTOSYNTHESIS; TRANSMEMBRANE. 



DR 



KW 



FT 


METAL 


86 


85 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY), 


FT 


METAL 


100 


100 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY), 


FT 


METAL 


187 


187 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


FT 


METAL 


202 


202 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY), 


SQ 


SEQUENCE 


215 AA; 


24180 MW 


CBAB4737 CRC32; 



Query Match 5,8%; Score 113; DB 1; Length 215; 

Best Local Similarity 23,5%; Pred. No. l,21e-02; 

Matches 20; Conservative 27; Mismatches 32; Indels 6; Gaps 6 

Db 57„YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I : :; ;| hlh; I hi I : I :::: |:|:| |:: :: | 
Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 



, Db 



114 REL-TWVTGWLAVLTASFGVTGYS 137 

I ;| Il:|: ::: :: 
190 CSLGSWARLDARAWTGLLALKHFG 214 



4 



CYB6_WHEAT STANDARD; PRT; 215 AA. 
P12362; 

Ol-OCT-1989 (REL. 12, CREATED) 

ol-oct-1989 (rel. 12, last sequence update) 
ol-dec-1992 (rel, 24, last annotation update) 
cytochrome b6 (ec 1.10.99 ,1) , 
petb; 

TRITICUM AESTIVDM (WHEAT), AND HORDEUM VULGARE (BARLEY), 
CHLOROPLAST, ■ 

EUKARYOTA; PLANTA; EMBRYOPHYTA;! ANGIOSPERMAE; MONOCOTYLEDONEAE; 
CYPERALES; -GRAMINEAE. , . ' 

SEQUENCE FROM N.A. , 
SPECIES-WHEAT; STRAIN-CV. MARDLER; 
MEDLINE; 91329710. 
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HIRD S.M., WILSON R.J., DYER T.A., GRAY J.C.; 

PLANT MOL. BIOL. 16;745-I47a9.91) . . 

[2] 

SEQUENCE FROM N.A. 

SPECIES-BARLEY; STRAIN-CV. SABARLIS; 
MEDLINE; 89240047. 

ANDREEVA A.V., BORYAKOVA A.A. , REVERDATTO S.V, , CHARHMAKHCHEVA O.G. , 
EFIMOV V.A.; 

NUCLEIC ACIDS RES. 17:2859-2860(1989). 
[3] 

SEQUENCE FROM N.A. 

SPECIES-BARLEY; STRAIN-CV. SABARLIS; 
MEDLINE; 92207253. 

EFIMOV V.A., ANDREEVA A.V,, REVERDATTO S.V./ CHAKHMAKHCHEVA O.G.; 
BIOORG. KHIM. 17:1369-1385(1991). 

-!- FUNCTION: COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 

OF THE CHLOROPLASTIC RESPIRATORY CHAIN, 
-!- CATALYTIC ACTIVITY; PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN - 

PLASTOQUINONE + 2 REDUCED PLASTOCYANIN. 
■!- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 

BOUND TO THE PROTEIN. 
-!- SUBUNIT: THE MAIN SUBUNITS OF COMPLEX B6-F ARE: CYTOCHROME B6, 

17 KD POLYPEPTIDE (PETD), CYTXHROME F AND THE RIESKE PROTEIN. 
•!• SIMILARITY; WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B, 
EMBL; X54749; G578659; -, 
,EMBL; X54751; G12363; -. 
EMBL; X14107; E274048; -. 
PIR; S09186; S09186. 
PIR; S04149; S04149. 
PIR; S14961; S14961. 
MENDEL; 4314; TRIAE;PETB;1. 
PROSITE; PS00192; CYTOCHROMEJJEME; 1. 

ELECTRON TRANSPORT; HEME; CHLOROPLAST; PHOTOSYNTHESIS; TRANSMEMBRANE. 



FT 


METAL 


86 


86 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) . 


FT 


METAL 


100 


100 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) . 


FT 


METAL 


187 


187 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) . 


FT 


METAL 


202 


202 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) . 


SQ 


SEQUENCE 


215 AA; 


24166 


MW; 0178C9B8 CRC32; 



Query Matcli 5.8%; 
Best Local Similarity 23,5*; 
Matches 20; Conservative 



Score 113; DB 1; Length 215; 

Pred. No. 1.21e-02; 

27; Mismatches 32; Indels 6; 



Db 57 YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I : :: :l l:||:: I hll : I :;:: |:|:| |:: ;: I 
Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TWVTGWLAVLTASFGVTGYS 137 

I :l ;:: :: 

Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT 5 

ID CYB6_T0BAC STANDARD; PRT; 215 AA. 

AC P06247; 

DT Ol-JAN-1988 (REL, 06, CREATED) 

DT, Ol-JAN-1988 (REL, 06, LAST SEQUENCE UPDATE) 

DT' Ol-APR-1990 (REL. 14, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B6 (EC 1,10.99.1). 

GN PEIB. 

OS NICOTIANA TABACUM (COMMON TOBACCO) . 

OG CHLOROPLAST. 

OC EUKARYOTA; PLANTA; EMBRYOPHYTA; ANGIOSPERMAE; DICOTYLEDONEAE; 

OC SOLANALES; SOLANACEAE, 

RN [1] 

RP SEQUENCE FROM N.A, 

RC STRAIN-CV, BRIGHT YELLOW 4; 

RA SDGIURAM.; 

RL SUBMITTED (AUG-1986) TO EMBL/GENBANK/DDBJ DATA BANKS. 



FT 


METAL 


86 


86 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


1 FT 


METAL 


100 


100 


IRON 2 (HEME AXIAL LIGAND) 


j FT 








(BY SIMILARITY), 


! FT 


METAL 


187 


187 


IRON 2 (HEME AXIAL LIGAND) 


'■ FT 








(BY SIMILARITY) . 


FT 


METAL 


202 


202 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


SQ 


SEQUENCE 


215 AA; 


24136 


HW; F4BE8F51 CRC32; 



[2] 

...COMPLETE-GENOME - 

SHINOZAKI K., OHME M., TANAKA M., WAKASOGI T,, HAYASHIDA N,, 
MATSUBAYASHI T., ZAITA N,, CHUNWONGSE J., OBOKATA J., 
YAMAGUCHI-SHINOZARI K,, OHTO C, TORAZAWA K., MENG B.Y., SUGITA M., 
DENO H., KAMOGASHIRA T., YAMADA K,, KUSUDA J,, TAKAIWA F., KATO A,, 
TOHDOH N,, SHIMADA H., SUGIURA M.; 
EMBO J. 5:2043-2049(1985), 



-!- f 



-1- 



FUNCTION; COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 
OF THE CHLOROPLASTIC RESPIRATORY CHAIN. 
CATALYTIC ACTIVITY: PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN - 
PLASTOQUINONE i 2 REDUCED PLASTOCYANIN. 
■!- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 

BOUND TO THE PROTEIN, 
-!- SUBUNIT; THE MAIN SUBUNITS OF COMPLEX B5-F ARE: CYTOCHROME B6, 
17 KD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN, 
-!- SIMILARITY: WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B, 
EMBL; Z00044; G11858; -, 
PIR; A00162; CBNT5. 

PROSITE; PS00192; CYTOCHROMEJJEME; 1. 

ELECTRON TRANSPORT; HEME; CHLOROPLAST; PHOTOSYNTHESIS; TRANSMEMBRANE. 



Query Match 5.7%; Score 111; DB 1; Length 215; 

Best Local Similarity 22.4%; Pred. No. 2,27e-02; 

Matches 19; Conservative 28; Mismatches 32; Indels 6; Gaps 6: 



Db 57 YYRPTVTEAFASVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFRKP 113 

I ; I hi I : I hhl h: ;; I 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TWVTGWLAVLTASFGVTGYS 137 

I ;| Ihl: ::: 
Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



NU6MJPIME STANDARD; PRT; 167 AA. 
P34857; 

Ol-FEB-1994 (REL. 28, CREATED) 
Ol-FEB-1994 (REL, 28, LAST SEQUENCE UPDATE) 
Ol-NOV-1995 (REL, 32, LAST ANNOTATION UPDATE) 
NADH-OBIQUINONE OXIDOREDDCTASE CHAIN 6 (EC 1,6,5,3). 
ND6, 

APIS MELLIFERA (HONEYBEE), 
MITOCHONDRION. 

EUKARYOTA; METAZOA; ARTHROPODA; INSECTA; HYMENOPTERA. 
[1] 

SEQUENCE FROM N,A, 
MEDLINE; 93114603. 
CROZIER R.H,, CROZIER Y.C.; 
GENETICS 133:97-117(1993). 

-!- CATALYTIC ACTIVITY: NADH + UBIQUINONE = NAD(+) + UBIQUINOL. 
EMBL; L06178; G829009; -. 
OXIDOREDUCTASE; NAD; UBIQUINONE; MITOCHONDRION. 
167 AA; 19971 MW; 36B64E1B CRC32; 



Query Match 5.5%; Score 106; DB 1; Length 167; 

Best Local Similarity 29.9%; Pred. No. 1.07e-01; 

Matches 20; Conservative 17; Mismatches 25; Indels 5; Gaps £ 

Db 11 IFMSSLISMILTIY-LNNIFNSPSMLLIYLISYSIYMSLMMFT-MCSMNSLLILMILIVF 68 

I I; ; II I I I : I ;: :|h:| I ;: : I :||: I I I 
Qy 147 IVASAQVWMI-TRYDLYHNFRPAVLLLMFLSVYKAFV-METFVHLCSLGSWARLDARAV- 203 



Best Available Copy 

Sun Jan 24 11:29:03 1999 US-09-162-597-3,rsp Page 4 



Db 69 LSGMLIM 75 

::|:| : 
Qy 204 VTGLLAL 210 



RP 'SEQUENCE FROM N,A, 

RC STRAIN-137C; 

RX MEDLINE; 91285146. 

RA BUESCHLEN S,, CHOQtJET Y,, KURAS R., WOLLMAN F,A,; 

RL FEES LETT, 284 :257-262(1991) . 

RN [3] 

RP CHARACTERIZATION, 

RC STRAIN-WT12; 

RX MEDLINE; 96094329. 

RA PIERRE Y,, BREYTON C, KRAMER D., POPOT J,-L.; 

RL J, BIOL. CHEM, 270;29342-29349(1995) . 

CC •!• FUNCTION: COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 
CC OF THE CHLOROPLASTIC RESPIRATORY CHAIN, 

CC ■!■ CATALYTIC ACTIVITY: PLASTOQOINOL-1 + 2 OXYDIZED PLASTOCYANIN - 
CC PLASTOQUINONE + 2 REDUCED PLASTOCYANIN, 

CC -!■ COFACTOR: TWO HEME GROUPS (B562 AND B556) WHICH ARE NOT COVALENTLY 
CC BOUND TO THE PROTEIN. 

CC ' - !- SUBUHIT; THE MAIN StJBUNITS OF COMPLEX B5-F ARE: CYTOCHROME B6, 
CC 17 KD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN, 

CC ■!• PTM: THE N-TERMINAL IS BLOCKED, 

CC -!• SIMILARITY: WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B. 

DR EMBL; X62905; G12499; -. 

DR .EMBL; X72918; G288909; -, 

DR PIR; S16917; S16917. 

DR PIR; S21253; S21253, 

DR MENDEL; 4312; CHLRE;PETB;1. 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1, 

KW ELECTRON-TRANSPORT; -HEME; .CHLOROPLAST; PHOTOSYNTHESIS; TRANSMEMBRANE.'.- 

FT METAL 86 86 IRON 1 (HEME AXIAL LIGAND) 

FT (BY SIMILARITY). 

FT METAL 100 100 IRON 2 (HEME AXIAL LIGAND) 

FT (BY SIMILARITY). 

FT METAL ■ 187 187 IRON 2 (HEME AXIAL LIGAND) 

FT (BY SIMILARITY). 

FT METAL 202 202 IRON 1 (HEME AXIAL LIGAND) 

FT (BY SIMILARITY). 

SQ SEQUENCE 215 AA; 24164 MW; E18A90C1 CRC32; 



RESULT 7 

ID CYB6_SYNP7 STANDARD; PRT; 215 AA, 

AC Q54711; 

DT' Ol-NOV-1997 (REL. 35, CREATED) 

DT Ol-NOV-1997 (REL. 35, LAST SEQUENCE UPDATE) 

DT Ol-NOV-1997 (REL, 35, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B6 (EC 1.10.99,1). 

GN PETB, 

OS SYNECHOCOCCUS SP, (STRAIN PCC 7942) (ANACYSTIS NIDULANS R2). 

OC PROKARYOTA; GRACILICOTES; OXYPHOTOBACTERIA; 

OC CYANOBACTERIA (BLUE-GREEN ALGAE); CHROOCOCCALES . 

RN [1] 

RP SEQUENCE FROM N.A. 

RA NEWNAN J., KARAKAYA H., SCANLAN D.J., CARR N.G., MANN N.H,; 

RL SUBMITTED (OCT-1995) TO EMBL/GENBANK/DDBJ DATA BANKS, 

CC -I- FUNCTION: COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 

CC OF THE CYANOBACTERIAL RESPIRATORY CHAIN. 

CC -!- CATALYTIC ACTIVITY: PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN - 

CC PLASTOQUINONE + 2 REDUCED PLASTOCYANIN, 

CC -!■ COFACTOR: TWO HEME GROUPS (B552 AND B566) WHICH ARE NOT COVALENTLY 
CC BOUND TO THE PROTEIN. 

CC -!• SUBUNIT: THE MAIN SUBUNITS OF COMPLEX B6-F ARE; CYTOCHROME B6, 
CC 17 KD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN, 

CC -!- SIMILARITY; WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B, 

DR EMBL; U33285; G9882B9; -, 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1, 



KW 


ELECTRON TRANSPORT 


HEME; 


PHOTOSYNTHESIS; TRANSMEMBRANE. 


FT 


METAL 


86 


86 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


FT 


METAL 


100 


100 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


FT 


METAL 


187 


187 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


FT 


METAL 


202 


202 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


SQ 


SEQUENCE 


215 AA; 


24357 


MW; 38B90BD7 CRC32; 



Query Match 5.5»; Score 107; DB 1; Length 215; 

Best Local Similarity 23.7%; Pred. No. 7,89e-02; 



Ma 


tches 


18; Conservative 20; Mismatches 32; Indels 5; Gaps 


Db 


57 


YYKPTVAEAYSSVQFIMNQVNFGWLI-R-SI-HRWSASMMVLMMILHVFRVYLIGGFKRP 113 
1 :; 1 |:M ; 1 ;:;: |:|;| I;; ;: | ; 
YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 


Qy 


■ 132 


Db 


114 


REL-TWVTGWMAVIT 128 

1 :| 11:1 
CSLGSWARLDARAWT 205 


Qy 


190 



RESULT 8 

ID 'CYBe.CHLRE STANDARD; PRT; 215 AA. 

AC Q00471; 

DT Ol-DEC-1992 (REL. 24, CREATED) • 

DT Ol-DEC-1992 (REL. 24, LAST SEQUENCE UPDATE) 

DT Ol-NOV-1997 (REL. 35, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B6 (EC 1.10.99.1) . 

GN PETB. 

OS CHLAMYDOMONAS REINHARDTII, 

OG CHWROPLAST. 

OC EOKARYOTA; PLANTA; PHYCOPHYTA; CHLOROPHYTA (GREEN ALGAE); 

OC CHLOROPHYCEAE; VOLVOCALES; CHLAMYDOMONADACEAE , 

RN [1] 

RP SEQUENCE FROM N, A. 

RX MEDLINE; 92256821. 

RA HUANG C, LIU X,-Q,; 

RL PLANT MOL. BIOL. 18:985-988(1992). 

s. RN [2] 



Query Match 5.5%; Score 107; DB 1; Length 215; 

Best Local Similarity 22,7%; Pred. No. 7.89e-02; 

Matches 15; Conservative 21; Mismatches 24; Indels 6; Gaps 6; 

Db 57 YYRPTVAEAFASVQYIMTDVNFGWLI-R-SI-HRWSASMMVLMMVLHVFRVYLTGGFKRP 113 

I ; :: :; |:||:: |;| I : I :::; |:| I |:; :: | : 
Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TW 118 
I :| 

Qy 190-CSLGSW 195 



RESULT 9 

ID ,CyB6jOSSP standard; PRT; 215 AA. 
•■'-'!' AC" P12122,; 

.DT Ol-OCT-1989 (REL. 12, CREATED) 
' 1 DT Ol-OCT-1989 (REL, 12, LAST SEQUENCE UPDATE) 
il DT , Ol-APR-1990 (REL, 14, LAST ANNOTATION UPDATE) 
DE CYTOCHROME B6 (EC 1,10,99.1). 
GN PETB. 

OS NOSTOC SP. (STRAIN PCC 7905). 
OC PROKARYOTA; GRACILICUTES; OXYPHOTOBACTERIA; 
OC . CYANOBACTERIA ( BLUE -GREEN ALGAE ) ; NOSTOCALES . 
RN [1]. 

RP SEQUENCE FROM N,A. 
■ RX MEDLINE; 89008280. ' • . 

RA KALLAS T., SPILLER S., MALKIN R.; ■ 
RL J. BIOL, CHEM. 263:14334-14342(1988). 

CC ■'■:!- FUNCTION; COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 
CC . . '. OF THE CYANOBACTERIAL RESPIRATORY CHAIN. 
CC . -!- CATALYTIC ACTIVITY; PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN ■ 
CC PLASTOQUINONE + 2 REDUCED PLASTOCYANIN. 

CC -!- COFACTOR; TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 
CC , BOUND TO THE PROTEIN, ' '■. 



Best Available Copy 

Sua Jan 24 11:29:03 1999 US-09-162-597-3.rsp Page 5 



cc 


■!• SUBONIT; THE MAIN SDBUNITS OF COMPLEX B6-F ARE: CYTOCHROME B5, 


FT 


METAL 


187 


cc 


17 RD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN. 


' ■ ' 'FT 


.METAL... 


202. . 


cc 


-!• SIMILARITY: WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B. 


FT 


CONFLICT 


108 


DR 


EMBL; J03967; G145023; -. 


FT 


CONFLICT 


188 


DR 


PIR; A30807; A30807. 


SQ 


SEQOENCE 


215 AA 



PROSITE; PS00192; CYTOCHROME JJEME; 1. 
KW ELECTRON TRANSPORT; HEME; PHOTOSYNTHESIS; TRANSMEMBRANE, 



FT METAL 
FT 



86 86 



IRON 1 (HEME AXIAL LIGAND) 
(BY SIMILARITY). 
IRON 2 (HEME AXIAL LIGAND) 
(BY SIMILARITY). 
IRON 2 (HEME AXIAL LIGAND) 
(BY SIMILARITY). 
IRON 1 (HEME AXIAL LIGAND) 
(BY SIMILARITY), 
SQ SEQOENCE 215 AA; 24259 MW; DEAE4993 CRC32; 



FT METAL 
FT 

FT METAL 
FT 

FT METAL 
FT 



100 100 
187 187 



202 202 



Query Match 5.51; 
Best Local Similarity 25,0%; 
Matches 19; Conservative 



Score 106; DB 1; Length 215; 

.Pred,Jo...l.07e-01;„. 

18; Mismatches' 33;" Indels 6;'" 



' 57 YYKPTVAEAFSSVEYIMNEVNFGWLI-R-SI-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I :: ;l I II: |:| I : I :::: |:|:| |:: ;: | 
132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

114 REL-TWVSGVILAVIT 128 

I :l ll;| 
190 CSLGSWARLDARAWT 205 



RESULT 10 

ID CYB6_SPI0L STANDARD; PRT; 215 AA. 

AC P00165; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT Ol-NOV-1988 (REL. 09, LAST SEQUENCE UPDATE) 

DT Ol-APR-1990 (REL. 14, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B6 (EC 1.10.99,1), 

GN PETB, 

OS SPINACIA OLERACEA (SPINACH), 

OG CHLOROPLAST, 

OC EUKARYOTA; PLANTA; EMBRYOPHYTA; ANGIOSPERMAE; DICOTYLEDONEAE; 

OC CARYOPHYLLALES; CHENOPODIACEAE, 

RN [1] 

RP SEQUENCE FROM N,A, 

RX MEDLINE; 88194670, 

RA WESTHOFF P,, FARCHAOS J,W., HERRMANN R,G.; 

RL CURR. GENET. 11:165-169(1985), 

RN [2] 

RP SEQUENCE FROM N.A. 

RA HEINEMEYER W., ALT J., HERRMANN R.G.; 

RL CURR. GENET. 8:543-549(1984), 

RN [3] 

RP SEQUENCE. 

RX MEDLINE; 84144766, 

RA WIDGER W.R,, CRAMER W.A., HERRMANN R.G., TREBST A.; 

RL PROC, NATL. ACAD. SCI. U.S.A. 81:674-678(1984). 

CC - ■!■ FUNCTION; COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 

CC OF THE CHLOROPLASTIC RESPIRATORY CHAIN, 

CC ■!- CATALYTIC ACTIVITY: PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN - 

CC PLASTOQUINONE + 2 REDUCED PLASTOCYANIN, 

CC -I- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 
CC ' BOUND TO THE PROTEIN. 

CC -I- SUBUNIT: THE MAIN SUBUNITS OF COMPLEX B6-F ARE: CYTOCHROME B6, 

CC 17 KD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN, 

CC -I- SIMILARITY: WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B. 

DR EMBL; X07105; G12285; •, 

DR PIR; A00161; CBSP6. 

DR PIR; S00429; S00429. 

DR MENDEL; 2553; SPI0L;PETB;1. 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1, 

KW ELECTRON TRANSPORT; HEME; CHLOROPLAST; PHOTOSYNTHESIS; TRANSMEMBRANE, 

FT METAL 86 86 IRON 1 (HEME AXIAL LIGAND) . 

FT METAL 100 100 IRON 2 (HEME AXIAL LIGAND) . 



187 IRON 2 (HEME AXIAL LIGAND) . 
.202 ™, .IRONX.(.HEME. AXIAL. LIGAND),. . , 
109 MISSING (IN REF, 3), 

188 MISSING (IN REF. 3). 
215 AA; 24166 MW; 35EE50AD CRC32; 

Query Match 5.5*; Score 106; DB 1; Length 215; 

Best Local Similarity 22.7%; Pred. No, l,07e-01; 

Matches 15; Conservative 21; Mismatches 24; Indels 6; Gaps ( 

Db 57 YYRPTVTDAFASVQYIMTEVNFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFRKP 113 

I ; ;; ;: I:||:; I;| I : I |:|:| |:: :: I 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TW 118 

I :| 

Oy 190 CSLGSW 195 



PRT; 215 AA, 



, UMESONO K,, INOKUCHI H., 



RESULT U 

ID CYB6J1ARP0 STANDARD; 

AC P06248; 

DT Ol-JAN-1988 (REL, 06, CREATED) 

DT Ol-JAN-1988 (REL, 06, LAST SEQUENCE UPDATE) 

DT Ol-MAR-1992 (REL, 21, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B6 (EC 1.10,99,1), 

GN PETB, 

OS MARCHANTIA POLYMORPHA (LIVERWORT) , 

OG CHLOROPLAST. 

OC EUKARYOTA; PLANTA; EMBRYOPHYTA; BRYOPHYTA; HEPATICOPSIDA. 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE; 89068687. 

RA FUKUZAWA H., KOHCHI T., SANO T., SHIRAI E 

RA OZERI H., OHYAMA K.; 

RL J, MOL. BIOL, 203:333-351(1988), 

RN [2] 

RP COMPLETE GENOME. 

RA OHYAMA K., FUKUZAWA H., KOHCHI T., SHIRAI H., SANO T,, SANO S,, 

RA UMESONO K,, SHIKI Y., TAKEUCHI M,, CHANG Z., AOTA S., INOKUCHI H., 

RA OZERI H.; 

RL NATURE 322:572-574(1986). 

RN [3] 

RP SEQUENCE FROM N.A, 

RA FUKUZAWA H., YOSHIDA T,, ROHCHI T., OKUMURA T., SAWANO Y., OHYAMA K.; 

RL FEES LETT, 220:61-56(1987). 

CC -I- FUNCTION: COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 
CC OF THE CHLOROPLASTIC RESPIRATORY CHAIN. 

CC •!• CATALYTIC ACTIVITY: PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN - 
CC PLASTOQUINONE + 2 REDUCED PLASTOCYANIN. 

CC •!- COFACTOR: TWO HEME GROUPS (B562 AND B556) WHICH ARE NOT COVALENTLY 
CC BOUND TO THE PROTEIN. 

CC •!• SOBUNIT: THE MAIN SUBUNITS OF COMPLEX B6-F ARE: CYTOCHROME B5, 

CC 17 KD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN. 

'CC •!• SIMILARITY: WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B. 

DR EMBL; X04465; E122080; -. 

DR PIR; A00163; CBLV6. 

DR PIR; S01552; S01552. 

DR PIR; S02432; S02432, 

DR MENDEL; 2551; MARP0;PETB;1. 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1. 

KW ELECTRON TRANSPORT; HEME; CHLOROPLAST; PHOTOSYNTHESIS; TRANSMEMBRANE. 



FT 


METAL , 


86 


85 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) . 


FT 


METAL 


100 


100 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) , 


FT 


METAL 


187 


187 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) , 


FT 


METAL 


202 


202 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) , 


SQ 


SEQUENCE 


215 AA; 


24307 MW; 


A8EDE412 CRC32; 



Query Match 



5,4%; Score 105; DB 1; Length 215; 



Best Available Copy \ 

Sun Jan 24 11:29:03 1999 US-09-162-597_-3.rsp Page 6 



Best Local Similarity 22,7%; Pred, No. 1.45e-01; 

Matches 15; Conservative 21; Mismatches 24; indels 6; Gaps ,„„5; 

Db 57 YYRPTVTEAFSSVQYIMTEVHFGWLI-R-SV-HRWSASMMVLMMILHIFRVYLTGGFKKP 113 

I : ;: :l |:||:: hi I ; I |:|:| ;:; :; I 

Qy 132 nQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TW 118 

I ;l 

Qy 190 CSLGSW 195 



RESULT 12 

ID CYB6_0D0SI STANDARD; PRT; 215 AA, 

AC P49488; 

DT Ol-FEB-1996 (REL. 33, CREATED) 

DT Ol-FEB-1996 (REL. 33, LAST SEQUENCE UPDATE) 

DT Ol-OCT-1996 (REL. 34, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B6 (EC 1.10.99,1). 

ON PETB. 

OS ODONTELLA SINENSIS. 

OG CHLOROPLAST. 

00 EUKARYOTA; PLANTA; PHYCOPHYTA; BACILLARIOPHYTA (DIATOMS). 

RN [1] 

RP SEQUENCE FROM N.A. 

RA KOWALLIK K.V., STOEBE B,, SCHAFFRAN I,, KROIH-PANCIC P., FREIER U,; 

RL PLANT MOL. BIOL. REP. 13:336-342(1995), 

CC -I- FUNCTION; COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 
CC OF THE CHLOROPLASTIC RESPIRATORY CHAIN. 

CC - ! • CATALYTIC ACTIVITY : PLASTOQUINOL-'r + 2' OXYDIZED' PLASTOCYANIN '= " ' ' 
CC PLASTOQUINONE + 2 REDUCED PLASTOCYANIN. 

CC -I- COFACTOR: TWO HEME GROUPS (B562 AND B556) WHICH ARE NOT COVALENTLY 
CC BOUND TO THE PROTEIN. 

CC -I- SUBUNIT; THE MAIN SUBUNITS OF COMPLEX B6-F ARE: CYTOCHROME B6, 
CC 17 KD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN. 

CC -I- SIMILARITY; WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B. 

DR EMBL; Z57753; E211859; -. 

DR MENDEL; 2552; ODOSI;PETB;1, 

DR PROSITE; PS00192; CYTOCHROMEJJEME; FALSEJEG. 

KW ELECTRON TRANSPORT; HEME; CHLOROPLAST; PHOTOSYNTHESIS; TRANSMEMBRANE. 



FT 


METAL 


86 


86 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) . 


FT 


METAL 


100 


100 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) , 


FT 


METAL 


187 


187 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) , 


FT 


METAL 


202 


202 


IRON 1 (HEME AXIAL LIGAND) 


FT 


SEQUENCE 






(BY SIMILARITY) . 


SQ 


215 AA; 


23923 MW 


79C43185 CRC32; 



Query Match 5.4%; Score 104; DB 1; ' Length 215; 

Best Local Similarity 26.3%; Pred, No. 1.97e-01; 

Matches 20; Conservative 19; Mismatches 31; Indels 6; Gaps 6; 

Db 57 YYRPSWDAFASVEYIMTSVNFGWLI-R-SI-HRWSASMMVMMMVLHVFRVYLTGGFKKP 113 ' 

1:1: ;: I |:| I : I :::: ::| I |:; :: I 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TWVTGVILAWT 128 

I :i nil 

Qy 190 CSLGSWARLDARAWT 205 



RESULT 13 



ID 


CYB6_P0RPU STANDARD; PRT; ' 215 AA. 


AC 


P51341; 


DT 


Ol-OCT-1996 (REL. 34, CREATED) 


DT 


Ol-OCT-1996 (REL. 34, LAST SEQUENCE UPDATE) 


DT 


Ol-OCT-1996 (REL. 34, LAST ANNOTATION UPDATE) 


DE 


CYTOCHROME B6 (EC 1.10.99,1). 


GN 


PETB, 


OS 


PORPHYRA PURPUREA. 


V OG 


CHLOROPLAST. 



OC EUKARYOTA; PLANTA; PHYCOPHYTA; RHODOPHYTA (RED ALGAE) , 

RN [1],„ 

RP SEQUENCE FROM N,A, 

RC STRAIN=AV0NPORT; 

RA REITH M.E,, MUNHOLLAND J,; 

RL t PLANT MOL. BIOL. REP, 13:333-335(1995) ,' 

CC •!- FUNCTION; COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 
CC OF THE CHLOROPLASTIC RESPIRATORY CHAIN. 

CC •!- CATALYTIC ACTIVITY: PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN - 
CC' PLASTOQUINONE + 2 REDUCED PLASTOCYANIN. 

CC -I- COFACTOR; TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 
CC BOUND TO THE PROTEIN,' 

CC -!- SUBUNIT: THE MAIN SUBUNITS OF COMPLEX B6-F ARE; CYTOCHROME B6, 
CC 17 KD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN. 

CC -!■ SIMILARITY: WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B, 

DR EMBL; U38804; G1276807; -. 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1, 

KW ELECTRON TRANSPORT; HEME; CHLOROPLAST; PHOTOSYNTHESIS; TRANSMEMBRANE. 



FT 


METAL 


86 


86 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) , 


FT 


METAL 


100 


100 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY), 


FT 


METAL 


187 


187 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY) , 


FT 


METAL 


202 


202 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


SQ 


SEQUENCE 


215 AA; 


24224 MW 


9F90E552 CRC32; 



_Query Match 5.4%; Score 104; DB 1; Length 215; 

"■ Best Local Similarity ■22.7%; Pred,'¥6, l,97e-01;' 
Matches 15; Conservative 20; Mismatches 25; Indels 6; Gaps 6; 

Db 57 YYRPTVAEAFTSVEYIMTDVNFGWLI-R-SI-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I : ;: ;: I Ih: hi I : I :::: hhl h: :; I 
Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TW 118 

I ;| 

Qy 190 CSLGSW 195 



RESULT 14 

ID CYB6JYNP2 STANDARD; PRT; 222 AA. 

AC P28056; 

DT Ol-AUG-1992 (REL. 23, CREATED) 

DT Ol-AUG-1992 (REL, 23, LAST SEQUENCE UPDATE) 

DT Ol-JUL-1993 (REL, 26, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B6 (EC 1.10.99.1). 

GN PETB. 

OS SYNECHOCOCCUS SP, (STRAIN PCC 7002) (AGMENELLUM QUADRUPLICATUM) . 

OC PROKARYOTA; GRACILICUTES; OXYPHOTOBACTERIA; 

OC CYANOBACTERIA (BLUE -GREEN ALGAE); CHROOCOCCALES . 

RN 11] 

RP SEQUENCE FROM N,A. 

.RX MEDLINE; 93043038. 

RA BRAND S.N., TAN X., WIDGER W.R.; 

RL PLANT MOL. BIOL, 20:481-491(1992). 

CC -!• FUNCTION: COMPONENT OF THE CYTOCHROME B6/F COMPLEX WHICH IS PART 
CC OF THE CYANOBACTERIAL RESPIRATORY CHAIN, 

CC -!- CATALYTIC ACTIVITY: PLASTOQUINOL-l + 2 OXYDIZED PLASTOCYANIN ■ 
CC PLASTOQUINONE + 2 REDUCED PLASTOCYANIN. 

CC -!- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 
CC BOUND TO THE PROTEIN. ' 

CC ' -I- SUBUNIT: THE MAIN SUBUNITS OF COMPLEX B6-F ARE: CYTOCHROME B5, 

CC 17 KD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN. 

CC -!- SIMILARITY: WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B, 

DR EMBL; X63049; G38963; -. 

DR PIR; S18123; S18123. ' 

DR ■ PIR; S26193; S26193. 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1, 

KW ELECTRON TRANSPORT; HEME; PHOTOSYNTHESIS; TRANSMEMBRANE, 

FT METAL 93 93' ■ IRON 1 (HEME AXIAL LIGAND) 

FT (BY SIMILARITY) , 



Best Available Copy i 
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FT 


METAL 


107 


107 


IRON 2 (HEME AXIAL LIGAHD) 


FT 








(BX.SIMILARITY). 


FT 


METAL 


194 


194 


IRON 2 {HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


FT 


METAL 


209 


209 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


SO 


SEQUENCE 


222 AA; 


25179 MW, 


7F5AB232 CRC32; 



Search completed; Fri Jan 22 18; CO; 58 1999 
Job .tinie,^,.-18 sees, 



Query Match 5,4»; Score 105; DB 1; Length 222; 

Best Local Similarity 21,2*; Pred. No. 1.45e-01; 

Matches 14; Conservative 21; Mismatches 25; Indels 6; Gaps 6; 

Db 64 YYKPTVAEAFTSVQYIMNEVNFGWLI-R-SI-HRWSASMMVLMMILHIFRVYLTGGFKRP 120 

I hlh |:| I ; I :;:: |:|:| ;:; ;: | : 

Oy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 



Db 121 REL-TW 125 
I :| 

Oy 190 CSWSW 195 



result' 15 

ID CYBe.CYAPA STANDARD; PRT; 215 AA, 

AC P48121; 

DT Ol-FEB-1996 (REL, 33, CREATED) 

DT Ql-FEB-1996 (REL, 33, LAST SEQUENCE UPDATE) 

DT Ol-FEB-1996 (REL, 33, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B6 (EC 1,10.99,1), 

GN PETB, 

OS CYANOPHORA PARADOXA, 

OG CYANELLE, 

OC EOKARYOTA; PLANTA; PHYCOPHYTA; GLAUCOPHYTA. 

RN [1] 

RP SEQUENCE FROM N, A, 

RC STRAIN-LB555 / PRINGSHEIM; 

RA STIREWALT V , L , , MICHALOWSKI C , B . , LUFFELHARDT W . , BOHNERT H , J , , 

RA BRYANT D, A.; 

RL SUBMITTED (JUL-1995) TO EMBL/GENBANK/DDBJ DATA BANKS, 

CC -I- FUNCTION; COMPONENT OF THE CYTOCHROME B5/F COMPLEX WHICH IS PART 

CC OF THE CYANELLE RESPIRATORY CHAIN, 

CC -!- CATALYTIC ACTIVITY: PLASTOQUINOL-1 + 2 OXYDIZED PLASTOCYANIN - 
CC PLASTOQUINONE + 2 REDUCED PLASTOCYANIN, 

CC -I- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 

CC ■ BOUND TO THE PROTEIN, 

CC -I- SUBONIT; THE MAIN SOBUNITS OF COMPLEX B6-F ARE; CYTOCHROME B6, 
CC 17 RD POLYPEPTIDE (PETD), CYTOCHROME F AND THE RIESKE PROTEIN, 

CC -!■ SIMILARITY; WITH THE AMINO END OF MITOCHONDRIAL CYTOCHROME B. 

DR EMBL; U30821; G1015119; -. 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1, 

KW ELECTRON TRANSPORT; HEME; CYANELLE; PHOTOSYNTHESIS; TRANSMEMBRANE, 



FT 


METAL 


86 


86 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


FT 


METAL 


100 


100 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


FT 


METAL 


187 


187 


IRON 2 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


FT" 


METAL 


202 


202 


IRON 1 (HEME AXIAL LIGAND) 


FT 








(BY SIMILARITY). 


SQ 


SEQUENCE 


215 AA; 


24188 MW 


1042B361 CRC32; 



Qjiery Match 5,3»; Score 103; DB 1; Length 215; . 

, Best Local Similarity 23,7%; Pred. No. 2,66e-01; 
Matches 18; Conservative 21; Mismatches 31; Indels 6; Gaps 5; 

Db 57 YYRPTVTEAFASVQYIMTDVNFGWLI-R-SI-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I ; :: hlh; hi I I ;;;: |;|:| |;: " I 
Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TWWGVILAVIT 128 

I :| Ihl 
Qy 190 CSLGSWARLDARAWT 205 
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24 99 


5.1 


215 


CBKL6P 


plastoquinol-'plastoc 


3.46e+00 


*********t**t****t**t******M**M**OUi*H**i**********W****M,***tt*.*i*A**.*.. - .. 


i 


-25-.— 99- 


5.1 


■ 247 


. G7100-9-— ' 


•hypothetical' protein-" 


•'3-:4'6e+a0 






26 98 


5,1 


351 


S5874S 


NADH dehydrogenase (u 


4.46e+00 


lT\ /Tl 1 1 1 _ 1 |"| fi 




27 99 


5,1 


363 


H69142 


perosamine synthetase 


3.46e+00 


i 1 \V /I 1 1 1 Mil 1 1 II 1 1 II II 




28 98 


5,1 


422 


A55417 


synaptotagmin II ■ mo 


4.46e+00 


V_\/ 1 1 1 I MM 1 1 M 1 1 II 11 




29 98 


5.1 


422 


BMRT2Y 


synaptotagmin II ■ ' ra 


4.46e+00 


II III 1 1 II III 1 1 




30 99 


5.1 


2206 


JC5280 


voltage^dependent cal 


3.46e+00 


1 II 1 l\ \ II 1 1 1 1 




31 97 


5.0 


87 


H70079 


hypothetical protein 


5.74e+00 


11 1 1 1 1 \ \ 1 1 1 1 1 1 

I 1 1 1 1 1 1 1 1 1 \ \ 1 1 II II 




32 97 


5.0 


172 


259457 


CDP"diacylglycerol " "g 


5.74e+00 


LI |_| |_| 1 1 |_| \_\ 1 1 |_| |_| (IM) 




33 97 


5.0 


598 


S75728 


hypothetical protein 


5.74e+00 






34 97 


5,0 


601 


A65129 


glutathione-regulated 


5,74e+00 


********************* t«««*«**«***t***«*t«t***«***i***i*******i*************it 




35 96 


4.9 


153 


B69434 


benzodiazepine recept 


7,37e+00 






36 95 


4,9 


162 


F71319 


hypothetical protein 


9,44e+00 


Release 3.1A John F. Collins, Biocomputing Research Unit. 




37 95 


4^9 


377 


C69758 


amino acid transporte 


9^446+00 


Copyright (c) 1993-1998 University of Edinburgh, (J.K, 




38 95 


4,9 


442 


S50332 


NADH dehydrogenase (u 


9.44e+00 


Distribution rights by Oxford Molecular Ltd 




39 96 


4,9 


683 


A41785 


system b(0,+) amino a 


7.37e+00 






40 95 


4,9 


1353 


JC4279 


adenylate cyclase (EC 


9.44e+00 


MPsrch_pp protein - protein database search, using Smith-Waterman algorithm 




41 94 


4,8 


239 


D71215 


hypothetical protein 


1.21e+01 






42 94 


4.8 


491 


S73844 


hypothetical protein 


1.21e+01 


Run on: Fri Jan 22 18; 00; 03 1999; MasPar time 11,44 Seconds 




43 94 


4,8 


578 


H65006 


hypothetical protein 


1.21e+01 


802.613 Million cell updates/sec , 




44 94 


4,8 


1415 


S52267 


DNA polymerase III • 


1.21e+01 


Tabular output not generated. 




45 94 


4.8 


2470 


S57085 


1-phosphatidylinosito 


L21e+01 



Title: 

Description: 
Perfect Score; 
Sequence: 

Scoring table; 
Searched: 



>US-09-162-597-3 

(1-245) from DS09162597.pep 

1940 

1 MTLFHFGNCFALAYFPYFIT GLVSQTLMYLFPASUJVLVK 245 

PAM 150 
Gap 11 

116738 seqs, 37463448 residues 



Post-processing: Hiniraom Match 0% 

Listing first 45 summaries 



Statistics ; 



pir58 

l;pirl 2;pir2 3:pir3 4:pir4 
Mean 45.371; Variance 103,911; scale 0.446 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



* 

Query 



SUMMARIES 



Match Length 


DB 


ID 


Description P 


red, No, 


6,1 


572 


2 


S73541 


phosphotransferase sy 


2.05e'02 


5,8 


215 


1 


CBZM6R 


plastoquinol--plastoc 


8,32e-02 


5.8 


215 


2 


S14961 


plastoquinol-plastoc 


8.32e'02 


5.8 


215 


1 


CBRZ6 


plastoquinol--plastoc 


8.32e-02 


5.8 


215 


1 


S09186 


plastoquinol-'plastoc 


8.32e'02 


5,8 


232 


1 


S04149 


piastoquinol'-plastoc 


8.32e-02 


5,8 


234 


2 


S58581 


plastoquinol--plastoc 


8,32e-02 


5,7 


215 


1 


CBNT6 


plastoquinol'-plastoc 


l,44e-01 


5.5 


167 


2 


S52970 


NADH dehydrogenase (u 


5.58e-01 


5.5 


215 


1 


S21253 


plastoquinol--plastoc 


4,27e-01 


5,5 


215 


1 


A30807 


plastoquinol-'plastoc 


5.58e-01 


5,5 


215 


1 


CBSP5 


plastoquinol'-plastoc 


5.58e-01 


5.4 


212 


2 


B69031 


conserved hypothetica 


7.286-01 


5.4 


215 


1 


CBLV6 


plastoquinol--plastoc 


7.286-01 


5.4 


215 


2 


S73262 


cytochrome b6 - red a 


9,48e-01 


5,4 


215 


2 


S78307 


plastoquinol-plastoc 


9.48e-01 


5.4 


222 


2 


S26193 


plastoquinol'-plastoc 


7,28e-01 


5.4 


310 


2 


D70139 


conserved hypothetica 


9,48e-01 


5.3 


222 


2 


S76297 


hypothetical protein 


1.236+00 


5.3 


222 


2 


S22470 


plastoquinol--plastoc 


1.23e+00 


5.3 


552 


2 


JC1173 


cycloheximide resista 


1.606+00 


5.2 


430 


2 


S15308 


hypothetical protein 


2.07e+00 


5.2 


570 


1 


C46238 


phosphotransferase sy 


2.076+00 



RESULT 
ENTRY 
TITLE 



1 



ORGANISM 
DATE 



ACCESSIONS 
REFERENCE 

lauthors 



tjournal 
ftitle 



S73541 , ttype complete 

phosphotransferase system enzyme I (EC 2,7,3.9) - Mycoplasma 
pneumoniae (SGC3) 

ALTERNATE NAMES phosphoenolpyruvate- -protein phosphotransferase; protein 
C12_orf572 
lformal_narae Mycoplasma pneumoniae 
27-Feb-1997 isequence_revision 25-Apr-1997 ttext change 

17-JU1-1998 
S73541 
S73327 

Himnelreich, R,; Hilbert, H.; Plagens, H,; Pirkl, E,; Li, 

B.C.; Herrmann, R. 
Nucleic Acids Res, (1996) 24:4420-4449 
Complete sequence analysis of the genome of the bacterium 
Mycoplasma pneumoniae, 
f cross-references moid; 97105885 
taccession S73541 

#tstatus nucleic acid sequence not shown; translation not shown 

ttmolecule.type DNA 

tiresidues 1-572 ttlabel HIM 

ttcross-references EMBL:AE000021; GB:U00089; NID:gl673871; PID:gl673880 
*fexperiraental_source ATCC 29342 

tfnote the nucleotide sequence was submitted to the EMBL Data 

t Library, November 1996 

GENETICS 

tgene ptsi 
tgenetic.code SGC3 

CLASSIFICATION tsuperfamily phosphotransferase system enzyme I; ' 

phosphotransferase system enzyme I homology 
KEYWORDS phosphotransferase; sugar transport system 

■FEATURE 

4-557 tdomain .phosphotransferase system enzyme I homology 

tlabel PTl 

SUMMARY #length 572 tmolecular-weight 53950 #checksum 7396 

Query Match 6,1*; Score 118; DB 2; Length 572; 

Best Local Similarity 28.8*; Pred, No. 2,05e-02; 

Matches 23; Conservative 26; Mismatches 25; Indels 5; Gaps 6; 

Db 446 SIGTN-DLIQYSFAADRMNKNVSY-LYQPLNPALLKLIYLTIEGGKVNDIWTGMCGEMAG 503 

II :l l;:l h ;: : I l|: : l|:| |;:|:: : I : ;|; ;:: 
Qy 136 SIDSNISLVHYIVASAQVWMITRYDLYHNFRPAVLLLMFLSVYKAFVMETFVHLCS-LGS 194 

Db 504 -EPL-AIPLLLGL-GLKEFS 520 

I I ::; II :ll h 
Qy 195 WARLDARAWTGLLALKHFG 214 



Best Available Copy 
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RESULT 2 

ENTRY CBZM6R Itype complete ' 

TITLE plastoquinol-'plastocyanin reductase (EC 1.10.99.1) 

cytochrome b5 - maize chloroplast 
ORGANISM tfornial_name chloroplast Zea mays llcoinmon_name maize 

DATE 31-Dec-1991 fsequence revision 31-Dec-1991 ttext change 

05-Sep-1997 
ACCESSIONS S08592 
REFERENCE S07171 

fauthors Rock, CD.; Barkan, A.; Taylor, W.C, 
.f journal Curr. Genet. (1987) 12:69-77 

Ititle The maize plastid psbB-psbF-petB-petD gene cluster; spliced 
and unspliced petB and petD RNAs encode alternative 
products , 
tcross-references MUID: 88210525 
taccession S08592 
stmolecule.type DNA 
ttresidues 1-215 Sflabel ROC 
ttcross-references EMBL;X05422; NID:gl2434; PlD:g311718 
GENETICS 

fgene petB 
fgenome chloroplast 
lintrons 2/3 
CLASSIFICATION tsuperfamily cytochrome b5; cytochrome b6 homology 
KEYWORDS chloroplast; chromoprotein; electron transfer; heme; iron; 
oxidoreductase; photosynthesis; thylakoid 

FEATURE 

15-215 Idomain cytochrome b6' homology tlabel CB6\ 

86,202 #binding_site heme iron (His) (axial Uganda) tstatus 

predicted\ 

100,187 fbinding_site heme iron (His) (axial ligands) tstatus 

predicted 

SUMMARY tlength 215 talecular-weight 24180 tchecksum 9478 

Query Match 5.8%; Score 113; DB 1; Length 215; 

Best Local Similarity 23.5*; Pred, No. 8.32e-02; 

Matches 20; Conservative 27; Mismatches 32; Indels 6; Gaps 6; 

Db 57 YYRPTVTEAFSSVOYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I : :l |;||:: I |:| I : I |:|:| |;; ;; I 

Qy , 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMEIFVHL 189 

Db 114 REL-TWVTGWLAVLTASFGVTGYS 137 

I ;l ::; :: 

Oy 190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT 3 

ENTRY S14961 stype complete 

TITLE plastoquinol-'plastocyanin reductase (EC 1.10.99.1) 

cytochrome b5 • wheat chloroplast 
ALTERNATEJAMES cytochrome b563 

ORGANISM lformal_name chloroplast: Triticum aestivum tcommon_narae 

comnion vheat 

DATE 08-Jun-1994 fsequence_revision 03-Aug-1995 ttext.change 

24-Apr-1998 
ACCESSIONS S14961 
REFERENCE S14960 

fauthors Hird, S.M.; Wilson, R.J.; Dyer, T.A.; Gray, J.C. 
tjournal Plant Mol. Biol. (1991) 16:745-747 
Ititle Nucleotide sequence of the wheat chloroplast petB and petD 
genes encoding apocytochrome b-563 and subunit IV of the 
cytochrome bf complex. 
Icross-references MUID:91329710 
taccession S14961 

♦tstatus preliminary 

ttmolecule.type DNA 

ttresidues 1-215 ttlabel HIR 

ttcross-references EMBL:X54751; NID;gl2351; PID:gl2363 ■ ,1 
GENETICS ■ , ; ■ 

^ tgene petB 



chloroplast 

lintrons 2/3 

CLASSIFICATION tsuperfamily cytochrome b6; cytochrome b6 homology 

KEYWORDS chloroplast; oxidoreductase; thylakoid 

FEATURE 

15-215 tdomain cytochrome b6 homology tlabel CB6 

SUMMARY tlength 215 Iraolecular-welght 24165 tchecksum 9436 



Query Match 5.84; 
Best Local Similarity 23.5%; 
Matches 20; Conservative 



Score 113; DB 2; Length 215; 

Pred. No. 8.32e-02; 

27; Mismatches 32; Indels 6; 



57 YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFRKP 113 

I ; :| hll:; I 1:1 I : I |:|:| |:; :: I 

132 YIQMSIDSNISLVHYIVASAQV-WNITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

114 REL-TWVTGWLAVLTASFGVTGYS 137 

I ;| l|;|; :;: ;: 
190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT 
ENTRY 
TITLE 



4 



CBRZ6 ttype complete 

plastoquinol--plastocyanin reductase (EC 1.10.99.1) 
cytochrome b6 - rice chloroplast 
ORGANISM tformaljame chloroplast Oryza sativa tcommon_name rice 
DATE 31-Mar-1990 tsequence revision 31-Mar-1990 ttext change 

18-Jul-1997 
ACCESSIONS JQ0256; S05136 
REFERENcF ""■JQ0200 " ■ ■ 

tauthors Shimada, H.; Whittier, R.F.; Hiratsuka, J.; Maeda, Y.; Hirai, 

A.; Sugiura, M. 
tsubmission submitted to JIPID, December 1989 
taccession JQ0256 
ttmolecule.type DNA 
ttresidues 1-215 Itlabel SHI 
ttexperiniental_source cv. Nihonbare 
REFERENCE S05080 

tauthors Hiratsuka, J.; Shimada, H.; whittier, R.; Ishibashi, T.; 

Sakamoto, M,; Mori, M.; Kondo, C; Honji, Y.; Sun, C.R.; 
Meng, B.Y.; Li, Y.Q.; Ranno, A.; Nishizawa, Y,; Hirai, A.; 
Shinozaki, K.; Sugiura, M. 
tjournal Mol. Gen. Genet. (1989) 217:185-194 
ttitle The complete sequence of the rice (Oryza sativa) chloroplast 
genome; intermolecular recombination between distinct tRNA 
genes accounts for a major plastid DNA inversion during the 
evolution of the cereals, 
tcross-references MUID:89364698 
taccession S05136 

tistatus nucleic acid sequence not shown; translation not shown 
ttmoleculejype DNA 
ttresidues 1-215 ttlabel HIR 
ttcross-references EMBL;X15901; NID:gll957; PID;g569082 
tiexperimental_source cv. Nihonbare 
COMMENT . This Cytochrome is one of the components of a specific ' '' • 'f 
■ . ■ stoichiometric cytochrome b6-f complex that contains two 
• molecules of , cytochrome b6, one cytochrome f and one nonheme 
iron-sulfur center. 

GENETICS 

tgene petB , 
tmap_position CP1232-71237, 72049-72690 
tgendme chloroplast 
tintrons 2/3 

CLASSIFICATION tsuperfamily cytochrome b6; cytochrome b6 homology 
,KE^;W0RDS chloroplast; chromoprotein; electron transfer; heme; iron; 

oxidoreductase; photosynthesis; thylakoid 

FEATURE 

15-215 ,' Idomain cytochrome b6 homology tlabel CB6\ 

86,202 " Ibindinglsite heme iron (His) (axial ligands) tstatus 

\ ' > predicted\ 

100,187 lbinding_site heme iron (His) (axial ligands) tstatus 

predicted • 

SUMMARY tlength 215. tmolecular-weight 24182 tchecksum 9304 
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Query Match 5.8%; Score 113 ^JB 1; Length 215; 

Best Local Similarity 23.5%; Pred. No. 8,32e-02; 

Matches 20; Conservative 27; Mismatches 32; Indels 6; Gaps 6 

Db 57 nRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I : :| hll:: I hll : i |;|;| |;: | 

Qy 132 nQMSIDSHISLVHnVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYRAFVMETFVHL 189 

Db 114 REL-TWVTGWLAVLTASFGVTGYS 137 

Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT 5 

ENTRY S09186 Stype complete 

TITLE plastoquinol-plastocyanin reductase (EC 1,10.99.1) 

cytochrome b6 (form 2) ■ barley chloroplast 
ORGANISM fformal.name chloroplast Hordeum vulgare tcommonjanie barley 
DATE 07-Jun-1990 tsequence_revision 02-Aug-1994 ttext change 

05-Sep-1997 
ACCESSIONS S09186; JN0348 
REFERENCE S04100 

lauthors Reverdatto, S,V,; Andreeva, A.V.; Buryakova, A. A.; 

Chakhraakhcheva, O.G.; Efimov, V.A. 
fjournal Nucleic Acids Res. (1989) 17;2859-2860 
f title Nucleotide sequence of the 5,2 kbp barley chloroplast DNA 
fragment, containing psbB-psbH-petB-petD gene cluster, 
f cross -references MUID: 89240047 
Haccession S09186 
l*molecule_type DNA 
titresidues 1-215 Iflabel REV 

tScross-references EMBL:X14107; NID:gll593; PID;e274048; PID:gl617031 
GENETICS 

tgene petB 
Hgenome chloroplast 
tintrons 2/3 

CLASSIFICATION tsuperfaraily cytochrome b6; cytochrome b6 homology 
KEYWORDS alternative splicing; chloroplast; chroraoprotein; electron 

transfer; heme; iron; oxidoreductase; photosynthesis; 

thylakoid 

FEATURE 

15-215 tdoraain cytochrome b6 homology tlabel CB6\ 

85,187 tbinding.site heme iron, low potential (His) (axial 

Uganda) Istatus predicted\ 
100,202 tbinding_site heme iron, high potential (His) (axial 

ligands) Ustatus predicted 
SUMMARY tlength 215 tmolecular-weight 24166 Ichecksum 9436 



Query Match 5,8%; Score 113; DB 1; Length 215; 

Best Local Similarity 23,5*; Pred, No. 8.32e-02; 

Matches 20; Conservative 27; Mismatches 32; Indels 6; Gaps 6 

Db 57 YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I : :: :l I hi I ; I :::: |:|:| |;: :: | 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TWVTGWLAVLIASFGVTGYS 137 

I :| :;: :: 

Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT '6 

ENTRY S04149 ftype complete 

TITLE plastoquinol'-plastocyanin reductase (EC 1.10.99.1) 

cytochrome b6 (form 1) - barley chloroplast 
ORGANISM iformal_name chloroplast Hordeum vulgare tcommonjame barley 

DATE 07-Jun-1990 «sequence_revision 02-Aug-1994 Itextlchange 

05-Sep-1997 
ACCESSIONS S04149; JN0381 
REFERENCE S04100 

Hauthors Reverdatto, S.V,; Andreeva, A, v.; Buryakova, A. A.; 

Chakhmakhcheva, O.G.; Efimov, V.A, 



tjournal Nucleic Acids Res. (1989) 17;2859-2860 
. tt-itXe-..- • Nucleotide sequence-of-the 5,2 kbp-barley-chloroplast-DNA— 
fragment, containing psbB-psbH-petB-petD gene cluster, 
tcross -references MUID:89240047 
taccession S04149 
titmolecule type DNA 
tfresidues 1-232 ttlabel REV 
I icross -references EMBL;X14107; NID;gll593; PID:gll595 
GENETICS 

tgene petB 
tgenome chloroplast 
CLASSIFICATION tsuperfamily cytochrome b5; cytochrome b6 homology 
KEYWORDS alternative splicing; chloroplast; chroraoprotein; electron 

transfer; heme; iron; oxidoreductase; photosynthesis; 
thylakoid 

FEATURE 

32-232 fdomain cytochrome b5 homology f label CB6\ 

103,204 fbinding_site heme iron, low potential (His) (axial 

ligands) f status predicted\ 
117,219 fbinding.site heme iron, high potential (His) (axial 

- rigands ) ' Istatus" predicted 

SUMMARY llength 232 tmolecular-weight 26078 tchecksum 4328 

Query Match 5,8%'; Score 113; DB 1; Length 232; 

Best Local Similarity 23,5%; Pred, No. 8,32e-02; 

Matches 20; Conservative 27; Mismatches 32; Indels 6; Gaps 6; 

Db 74 YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVIMMILHVFRVYLTGGFRKP 130 

I ; :: :l I |:| I : I |:|;| |;: :; I 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 131 REL-TWVTGWLAVLTASFGVTGYS 154 

Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



7 



ENTRY 
TITLE 



S58581 ttype complete 

plastoquinol'-plastocyanin reductase (EC 1.10,99.1) 
cytochrome b6 ■ maize chloroplast 
ORGANISM tformal_name chloroplast Zea mays tcommonjame maize 

DATE 29-N0V-1995 lsequence_revision 19-Jan-1996 Itext.change 

08-Sep-1997 
ACCESSIONS S58581 
REFERENCE S58531 

lauthors Maier, R,M,; Neckermann, K.; Igloi, G.L.; Koessel, H, 

Ijournal J. Mol, Biol. (1995) 251:614-628 

f title Complete sequence of the maize chloroplast genome: gene 

content, hotspots of divergence and fine tuning of genetic 
information by transcript editing, 
taccession S58581 
ttstatus nucleic acid sequence not shown; translation not shown 
itmoleculejype DNA 
ftresidues 1-234 ttlabel MAI 
Itcross-references EMBL;X86553; NID;g902200; PID:g902251 
ttnote the nucleotide sequence was submitted to the embl Data 

Library, April 1995 

GENETICS 

tgene petB 
tgenome chloroplast 
tintrons 2/3 

CLASSIFICATION tsuperfamily cytochrome b6; cytochrome b6 homology 
KEYWORDS chloroplast; electron transfer; heme; membrane-associated' 

complex; oxidoreductase; photosynthesis; thylakoid 

FEATURE 

34-234 tdomain cytochrome b6 homology tlabel CB6 

SUMMARY tlength 234 tmolecular-weight 26182 tchecksum 5835 

Query Match ■ 5.8%; Score 113; DB 2; Length 234; 

Best Local Similarity 23.5%; Pred. No. 8.32e-02; 

Matches 20; Conservative 27; Mismatches 32; Indels 6; Gaps 6; 

Db 76 YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKRP 132 



Best Available Copy 

Sun Jan 24 11:29:02 1999 
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I : :; :| IMh; I |:M ; I |;|;| h: ;: I 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189, 

Db 133 REL-TWVTGWLAVLTASFGVTGYS 156 

I :| 11:1: ::: :: 
Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT 8 

ENTRY CBNT6 Stype complete 

TITLE plastoquinol--plastocyanin reductase (EC 1.10.99.1) 

cytochrome b6 - coiranon tobacco chloroplast 
ALTERNAIEJAMES Cytochrome b563 

ORGANISM #formal_naine chloroplast Nicotiana tabacum «common_narae 

common tobacco 

DATE 30-Jun-1987 tsequence.revision 30-Jun-1987 ftext change 

04-Oct-1996 
A00162 
A00149 

tauthors Sugiura, M, 

tsubraission submitted to the EMBL Data Library, August 1986 
taccession A00162 
tSmolecule.type dha 
tfresidues 1-215 Sflabel SUG 
*texperimental_source cv. Bright Yellow 4 
A38013 

Shinozaki, K.; Ohme, M.; Tanaka, M.; Wakasugi, T,; l 
N.; Matsubayashi, T.; Zaita, N,; Chunwongse, J.; Obokata, 
J.; Yaraaguchi-Shinozaki, K.; Ohto, C; Torazawa, K,; Meng, 
B.Y.; Sugita, M.; Deno, H.; Kamogashira, T,; Yamada, K,; 
Kusuda, J.; Takaiwa, F.; Kato, A.; Tohdoh, N,; Shimada, H.; 
Sugiura, M, 
EMBO J. (1986) 5:2043-2049 

The complete nucleotide sequence of the tobacco chloroplast 

genome; its gene organization and expression, 
annotation; gene organization, sites, features 
This cytochrome is one of the components of a specific 
stoichiometric cytochrome b6-f complex that contains two 
molecules of cytochrome b6, one cytochrome f, and one nonheme 
iron-sulfur center. 

GENETICS 

tgene petB 

chloroplast 
2/3 

itsuper family cytochrome b6; cytochrome b6 homology 
chloroplast; chromoprotein; electron transfer; heme; iron; 
oxidoreductase; photosynthesis; thylakoid 

idoraain cytochrome b6 homology flabel CB6\ 
tbindingjite heme iron (His) (axial Uganda) tstatus 
predicted\ 

lbinding_site heme iron (His) (axial ligands) tstatus,' 
predicted 

SUMMARY llength 215 fmolecular-weight 24136 tchecksum 9633 

Query Match' 5,7%; Score 111; DB 1; Length 215; 

Best Local Similarity 22.4%; ,Pred. No. 1.44e-01; 

Matches 19; Conservative 28; Mismatches 32; Indels 6; Gaps 6; 

Db 57 YYRPTVTEAFASVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I : :: :: |:||;: I |:| I : I :;:;|;|;| |;: :: | 
Qy 132 YIQMSIDSNISLVHYIVASAOV-WMITRYDLYHHFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TWVTGWLAVLTASFGVTGYS 137 

I :| 11:1: ::: ;: 
Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



tauthors 



tjournal 
ttitle 



tcontents 

COMMENT 



tintrons 
CLASSIFICATION 
KEYWORDS 

FEATURE 
15-215 
86,202 

100,187 



RESULT 
ENTRY 
TITLE 

vORGANISM 



S52970 Itype complete 

NADH dehydrogenase (ubiquinone) (EC 1,6,5.3) chain 6 • 

honeybee mitochondrion (SGC4) 
Iformaljame mitochondrion Apis mellifera tcommon.name 



ACCESSIONS 



tauthors 
tjournal 
ttitle 

taccession 



DATE „ 15-JU1-1995 *sequence_r.evision 10-Novl995 ttext change 
21-Aug-1998 
S52970 
S52960 

Crozier, R.H.; Crozier, Y,C. 
Genetics (1993) 133:97-117 

The mitochondrial genome of the honeybee Apis mellifera: 

complete sequence and genome organization. 
S52970 
ttmoleculejype DNA 
■ ttresidues 1-167 ttlabel CRO 

I tcross -references EMBL:L06178; NID:g336279; PID;g829009 
tfexperimental.source ligustica 
GENETICS 

tgenome mitochondrion 
tgenetic_code SGC4 
tstart_codon ATT 

CLASSIFICATION tsuperfamily NADH dehydrogenase (ubiquinone) chain 6 
KEYWORDS membrane-associated complex; mitochondrion; NAD; 

oxidoreductase 

SUMMARY tlength 167 Kmolecular-weight 19971 tchecksum 6545 

Query Match 5.5*; Score 106; DB 2; Length 167; 

Best Local Similarity 29.9*; Pred. No. 5.58e-01; 

Matches 20; Conservative 17; Mismatches 25; Indels 5; Gaps ! 

Db 11 IFMSSLISMILTIY-LNNIFNSPSMLLIYLISYSIYMSLMMFT-MCSMNSLLILMILIVF 68 

I I: : II I I I ; I :: :||;;| | :; ; | :||: | | | 
Qy 147 IVASAQVWMI-TRYDLYHNFRPAVLLLMFLSVYKAFV-METFVHLCSLGSWARLDARAV- 203 

Db 69 LSGMLIM 75 

:;|:| : 
Qy 204 VTGLLAL 210 



RESULT 
ENTRY 
TITLE 



10 



ORGANISM 
DATE 



ACCESSIONS 



tauthors 
t journal 
.ttitle- 



S21253 ttype complete 

plastoquinol--plastocyanin reductase (EC 1,10.99,1) 

cytochrome b6 - Chlamydomonas reinhardtii chloroplast 
tformaljarae chloroplast Chlamydomonas reinhardtii 
03-Feb-1994 tsequence_revision 02-Aug-1994 ttext change 

05-Sep-1997 
S21253; S16917 
S20938 

Huang, C; Liu, X.Q. 
Plant Mol. Biol, (1992) 18:985-988 
Nucleotide sequence of the frxC, petB and trnL genes in the 
chloroplast genome of Chlamydomonas reinhardtii , 
tcross-references MUID; 92256821 
taccession S21253 

' ttstatus translation not shown 
• ttmolecule_type DNA 
■ ttresidues 1-215 ttlabel HUA 
ttcross-references EMBL:X62905; NID:gl2497; PID:gl2499 
S16916 , 

Bueschlen, S,; Choquet, Y,; Kuras, R.; Wollman, F,A, 
FEBS Lett, (1991) 284:257-262 

Nucleotide sequences of the continuous and separated petA, 
petB and petD chloroplast genes in Chlamydomonas 
, ■ reinhardtii. 

tcross-references MUID: 91285146 
taccession S16917 
ttmolecule.type DNA 
ttresidues 1-215 ttlabel BDE 
ttcross-references EMBL;X72918; NID:g503530; PID:g288909 
(3ENETICS , , . 

tgene , petB 

'chloroplast 

tsuperfamily cytochrome b6; cytochrome b6 homology 
chloroplast; chromoprotein; electron transfer; heme; iron; 
oxidoreductase; photosynthesis; thylakoid 



, . tauthors 
' tjournal 
, ttitle 



(, I tgenome 

CLASSIFICATION 

KEYWORDS 



FEATURE 



!' 



Sun Jan 24 11:29:02 1999 
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15-215 
86,187 



100,202 



tdomain cytochrome b6 homology t label CB6\ 
»binding_sitG heme iron, low potential (His) (axial,,. 

ligands) istatus predicted\ 
tbinding_site heme iron, high potential (His) (axial 

ligands) f status predicted 
tlength 215 fmolecular-weight 24164 Itchecksum 8607 



Query Match 5.54; Score 107; DB 1; Length 215; 

Best Local Similarity 22.7%; Pred. No. 4.27e-01; 

Matches 15; Conservative 21; Mismatches 24; Indels 6; Gaps 6 

Db 57 YYRPTVAEAFASVQyiMTDVNFGWLI-R-SI-HRWSASMMVLMMVLHVFRVYLTGGFKRP 113 

I ; ;; :; hlh: |:| I ; I |:| | |;; | : 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WHITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TW 118 

I :| 

Qy 190 CSLGSW 195 



RESULT 
ENTRY. 
TITLE 



11 



ORGANISM 
DATE 



ACCESSIONS 



Hauthors 
tjournal 
»title 



A30807 ftype complete 

plastoquinol-plastocyanin reductase (EC 1.10.99.1) 

cytochrome b6 ■ Nostoc sp. 
tfornial_name Nostoc sp. 

Ol-Dec-1989 Usequence.revision 02-Aug-1994 #text change 

04-Oct-1996 
A308O7 
A94683 

Kallas, T.; Spiller, S.; Malkin, R. 
J. Biol. Chem. (1988) 263:14334-14342 
Characterization of two operons encoding the cytochrome b-6-f 
complex of the cyanobacterium Nostoc PCC 7906. Highly 
conserved sequences but different gene organization than in 
chloroplasts . 
#cross -references MUID;B9008280 
Saccession A30807 
Slmoleculejype DBA 
tiresidues 1-215 tflabel KAL 
ftcross -references GB;J02961 
ftexperimental_source PCC 7906 
CLASSIFICATION fsuperfamily cytochrome b6; cytochrome b6 homology 

chromoprotein; electron transfer; heme; iron; oxidoreductase; 
photosynthesis; thylakoid 



KEYWORDS 



FEATURE 
15-215 
86,187 



tdomain cytochrome b6 homology f label CB6\ 
tbinding_site heme iron, low potential (His) (axial 
ligands) f status predicted\ 
100,202 fbindingjite heme iron, high potential (His) (axial 

ligands) f status predicted 
SUMMARY llength 215 fmolecular-weight 24259 #checksum 8311 

Query Match 5,5»; Score 106; DB 1; Length 215; 

Best Local Similarity 25.04; Pred. No. 5.58e-01; 

Matches 19; Conservative 18; Mismatches 33; Indels 6; Gaps 6; 

Db , 57 YYRPTVAEAFSSVEYIMNEVNFGWLI-R-SI-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I :: :| I Ih |:| I : I :::: |:|:| |;: ;: | 
Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TWVSGVILAVIT 128 

Qy 190 CSLGSWARLDARAWT 205 



ENTRY CBSP6 ftype complete 

TITLE plastoquinol--plastocyanin reductase (EC 1.10,99.1) 

cytochrome b6 - spinach chloroplast 
ORGANISM fformaljame chloroplast Spinacia oleracea Icoramonjarae 

spinach 

DATE 03-Aug-1984 ftsequence_revision 02-Aug-1994 »text change 

17-Apr-1998 



S03021; A00161; S00429 

S03021 

Westhoff, P.; Herrmann, R.G, 
Eur, J. Biochem, (1988) 171:551-564 
Complex RNA maturation in chloroplasts , 
Icross -references MUID;88151952 
taccession S03021 
tiraolecule.type DNA 
firesidues 1-22 titlabel WES 
Itnote 



tauthors 
tjournal 
ititle 



ttnote 



tauthors 
tcitation 



this sequence is a revision to the sequence from 

reference S00429 
not compared with conceptual translation 
A00161 

Widger, W.R.; Cramer, W.A.; Herrmann, R.G.; Trebst, A. 
unpublished results, 1984, cited by Widger, W.R., Cramer, 
W.A., Herrmann, R.G., Trebst, A., Proc, Natl. Acad. Sci. 
0,S,A. 81, 674-678, 1984 
tcross -references MUID:84144766 

,J.acc.es5ion„....A00161 „ ., 

ttstatus nucleic acid sequence not shown 
ttmolecule_type DNA 

'MIGSKNVSRFRRLRMI', 21-215 tflabel WID 
S00429 

Heinemeyer, W.; Alt, J.; Herrmann, R.G. 
Curr. Genet. (1984) 8:543-549 

Nucleotide sequence of the clustered genes for apocytochrome- 
b6 and subunit 4 of the cytochrome b/f complex in the 
spinach plastid chromosome. 
S00429 
ttmoleculejype DNA 
■ tiresidues 'MIGSKNVSRFRRLRMI', 21-215 tflabel HEI 
I tcross -references EMBL;X07106 

ttnote this sequence has been revised in reference S03021 

COMMENT This Cytochrome is the chloroplast counterpart of the mitochondrial 
cytochrome b. 

GENETICS 



tauthors 
tjournal 
ttitle 



taccession 



tgenome 
tintrons 
CLASSIFICATION 



FEATURE 
15-215 
86,187 



petB 

chloroplast 
2/3 

fsuperfamily cytochrome b6; cytochrome b6 homology 
chloroplast; chromoprotein; electron transfer; heme; iron; 
oxidoreductase; photosynthesis; thylakoid 

tdomain cytochrome b6 homology tlabel CB6\ 
fbinding_site heme iron, low potential (His) (axial 

ligands) Istatus predicted\ 
Jbinding_site heme iron, high potential (His) (axial 

ligands) tstatus predicted 
tlength 215 tmolecular -weight 24166 tchecksum 9582 



Query Match 5.54; Score 106; DB 1; Length 215; 

Best Local Similarity 22.74; Pred. No. 5.58e-01; 

Matches 15; Conservative 21; Mismatches 24; Indels 5; Gaps 6; 

Db • 57 YYRPTVTDAFASVQYIMTEVNFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 1131 • 

!■ : :; ;: |:||:: hll : I |:|:| |:: ;: I 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-IW 118 
I I :| 

Qy 190 CSLGSW 195 



RESULT 13 

ENTRY 

TITLE 

ORGANISM 
DATE 



B69031 ftype complete 

conserved hypothetical protein MTH1229 - Methanobacterium 

thermoautotrophicum (strain Delta H) 
IformaLname Methanobacterium thermoautotrophicum 
05-Dec-1997 lsequence_revision 05-Dec-1997 ttext change 

13-Sep-1998 
B59031 
A59000 

Smith, D.R.; Doucette-Stamm, L.A.; Deloughery, C; Lee, H.; 



Best Available Copy i , 
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Dubois, J,; Aldredge, T.; Bashirzadeh, R.; Blakely, D,; 
Cook, R.; Gilbert, K,; Harrison, D,; Hoang, L.; Keagle,„.P,; 
Luniiii, W,; Pothier, B.; Qiu, D.; Spadafora, R.; Vicaire,"R.; 
Wang, Y.; wierzbowski, J.; Gibson, R,; Jiwani, N.; Caruso, 



A.; Bush, D.; Safer, H,; Patwell, 
McDougall, S,; Shinier, G.; Goyal, 



; Prabhakar, S.; 
; Pietrokovski, S,; 

; Noelling, 



tjournal 
ttitle 



Church, G.M.; Daniels, C.J,; Mao, J.; Rice, 
J.; Reeve, J.N. 
J. Bacteriol, (1997) 179:7135-7155 
Complete geDome sequence of Methanobacterium 
thermoautotrophicum Delta H; functional analysis and 
comparative genomics, 
'tcross-references MUID:98037514 
faccession B69031 

titstatus preliminary; nucleic acid sequence not shown; 

translation not shown 
*toolecule_type DNA 
titresidues 1-212 tllabel MTH 

tScross-references GB;AE000890; GB;AE000666; NID:g2622331; PID:g2522340 
tiexperimentaLsource strain Delta H 
GENETICS 

M1H1229 

fsuperfamily Methanococcus jannaschii conserved hypothetical 

protein MJ0805 
tlength 212 tmolecular -weight 23956 tchecksum 2161 



fgene 
CLASSIFICATION 



SDMMARY 



Query Match 5,4*; Score 105; DB 2; Length 212; 

Best Local Similarity 31,5%; Pred. No. 7.28e-01; 

Matches 23; Conservative 17; Mismatches 27; Indels 6; Gaps E 

Db 44 AWALSELGIKGSSIITWIGPMDVKNIVDYEDVELEIRSEEVL-HFIVEHFDE-QPSSLRL 101 
:|| :|l I:: I hi: : :: I ;:: I |: 1 |:|| ■ ; : h . 
Qy 105 GWATAEL-IMSRCIPLWVGARGI-EF-DWKYIQMSIDSNISLVHYIVASAQVWMITRYDL 161 

Db 102 AYHRQRILVMLLM 114 

II I hill 
Qy 162 -YHNFRPAVLLLM 173 



RESULT 
ENTRY 
TITLE 



14 



ORGANISM 
DATE 



ACCESSIONS 



#authors 



tjournal 
«title 



CBLV6 ftype complete 

plastoquinol--plastocyanin reductase (EC 1.10.99.1) 
cytochrome b6 - liverwort (Marchantia polporpha) 
chloroplast 

tformaljarae chloroplast Marchantia polporpha 
30-Jun-1987 tsequence_revision 30-Jun-1987 *text change 

04-Oct-1996 
S01552; S02432; A00163 
S01529 

Fukuzawa, H.; Kohchi, T.; Sanb, T.; Shirai, H.; Cmesono, K.; 

Inokuchi, H.; Ozeki, H.; Ohyama, K, 
J. Mol, Biol. (1988) 203:333-351 
Structure and organization of Marchantia polymorpha 
chloroplast genome. Ill, Gene organization of the large 
single copy region from rbcL to trnl(CAO). 
tcross-references MniD:89068687 
taccession S01552 

ttmoleculejype DNA j 
ftresidues 1-215 ftlabel FUK 
ttcross -references EMBL:X'04465 
A38014 

Ohyama, K,; Fukuzawa, H,; Kohchi, T.; Shirai, H,; Sano, T.; 
Sano, S,; Dmesono, K,; Shiki, Y,; Takeuchi, M,; Chang, Z.;. 
Aota, S,; Inokuchi, H.; Ozeki, H, 
Nature (1986) 322:572-574 

Chloroplast gene organization deduced from complete sequence 

of liverwort Marchantia polymorpha chloroplast DNA, 
annotation; gene organization, sites, features 
S02432 

Fukuzawa, H,; Yoshida, T,; Kohchi, T,; Okumura, T,; Sawano, 

Y,; Ohyama, K. ' 
FEBS Lett. (1987) 220:61-65 

Splicing of group II introns in rnRNAs coding for cytochrome 



tauthors 



tjournal 
ttitle 



tcontents 



tauthors 



tjournal 
ttitle 



b5 and subunit IV in the liverwort Marchantia polymorpha 
chloroplast genome. ,Exon specifying a region coding Jor two 
genes with the spacer region, 
taccession S02432 

tistatus not compared with conceptual translation 
ttniolecule_type mRNA 
ttresidues 1-8 ttlabel FOW 
GENETICS 



tgenorae 
,t introns 

CLASSIFICATION 

KEYWORDS 

FEATURE 
' 15-215 
86,187 



chloroplast 
2/3 

tsuperfamily cytochrome b5; cytochrome b6 homology 
chloroplast; chromoprotein; electron transfer; heme; iron; 
oxidoreductase; photosynthesis; thylakoid 

tdomain cytochrome b6 homology llabel CB6\ 
tbinding_site heme iron, low potential (His) (axial 
ligands) tstatus predicted\ 
100,202 tbinding_site heme iron, high potential (His) (axial 

ligands) tstatus predicted 
SUMMARY tlength 215 traolecular-weight 24307 tchecksum 8453 

Query Match 5.4%; Score 105; DB 1; Length 215; 

Best Local Similarity 22. 7»; Pred, No, 7, 28e- 01; 

Matches 15; Conservative 21; Mismatches 24; Indels 6; Gaps £ 

Db 57 YYRPTVTEAFSSVQYIMTEVNFGWLI-R-SV-HRWSASMMVLMMILHIFRVYLTGGFKKP 113 

I : ;: :| hlh; hi I : I :;:; hhl :;: :: | 
Qy 132 YIQMSIDSHISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TW 118 

I :| 

Qy 190 CSLGSW 195 



RESULT 
ENTRY 
TITLE 
ORGANISM 
DATE 



15 



S73252 ttype complete 

cytochrome b6 • red alga (Porphyra purpurea) chloroplast 
tformal_name chloroplast Porphyra purpurea 
19-Mar-1997 tsequence.revision 09-Mayl997 ttext change 

24-Apr-1998 
S73262 
S73108 

Reith, M,; Munholland, J, 
Plant Mol. Biol. Rep. (1995) 13:333-335 
Complete nucleotide sequence of the Porphyra purpurea 

chloroplast genome. 
S73262 

preliminary; nucleic acid sequence not shown; 
translation not shown 
ttmoleculejype DNA 
ttresidues 1-215 ttlabel REI 
ttcross-references EMBL:U38804; NID;gl275652; PID;gl276807 



tauthors 
tjournal 
ttitle . 

taccession 
ttstatus 



ttnote 
GENETICS 

tgene 
tgenome 
CLASSIFICATION 



15-215 
SUMMARY ■ 



the nucleotide sequence was submitted to the EMBL Data 
■Library, October 1995 

petB 

chloroplast 

tsuperfamily cytochrome b5; cytochrome b6 homology 
chloroplast; thylakoid 

tdomain cytochrome b5 homology tlabel CB6 
tlength 215 tmolecular-weight 24224 tchecksum 9699 



'Query Match 5.4%; 
Best Local Similarity 22.7%; 
Matches 15; Conservative 



Score 104; DB 2; Length 215; 

Pred, No. 9,48e-01; 

20; Mismatches 25; Indels 6; 



Db 57 YYRPTVAEAFTSVEYIMTDVNFGWLI-R-SI-HRWSASMMVLMMILHVFRVYLTGGFKKP 113 

I : :: I Ih; hi I : I ■ hhl h; :: | 
Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TW 118 



Best Available Copy 

Sjiir Jan 24 11:29:02 1999 



Qy 190 CSLGSW 195 

Search completed; Fri Jan 22 18:00:22 1999 
Job time : 19 sees. 
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Release 3.1A John F. Collins, Biocomputing Research Dnit, 
Copyright (c) 1993-1998 University of Edinburgh, U.K. 
Distribution rights by Oxford Molecular Ltd 

MPsrchjp protein - protein database search, using Smith-Waterman algorithm 

Run on; Fri Jan 22 18:03:071999;'' Masfar~time~"3r5rseronds 

597.255 Million cell updates/sec 

Tabular output not generated. 



Title: 

Description: 
Perfett Score: 
Sequence: 

.Scoring table: 



>US-09-162-597-3 

(1-245) from US09162597.pep 

1940 

1 MTLFHFGNCFALAYFPIFIT GLVSQTMYLFPASLQVLVK 245 

PAM 150 
Gap 11 



Searched: 92929 seqs, 8738560 residues 

Post-processing; Minimum Match 0* 

Listing first 45 summaries 

Database: a- issued 

1:5_C0MB 2:PCT9_COMB 3:bacl(fiiesl 

Statistics: Mean 31.165; Variance 148.701; scale 0.210 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution, 



SUMMARIES 



Query 



No. Score 


Match Length D 


3 ID 


Description 


Pred, No. 


1 94 


4.8 772 


PCT-DS95-0 


Sequence 2, Applicatio 


l,66e+01 


2 94 


4.8 772 


DS-08-258- 


Sequence 2, Applicatio 


1.66e+01 


3 89 


4.6 349 


US-08-118- 


Sequence 71, Applicati 


3.57e+01 


4 89 

5 87 


4.6 349 


PCT-DS93-0 


Sequence 71, Applicati 


3.57e+01 


4.5 307 


OS-08-948- 


Sequence 3, Applicatio 


4,82e+01 


• 5 87 


4.5 417 


DS-08-553- 


Sequence 2, Applicatio 


4.82e+01 


7 85 


4.4 201 


5489533-3 


Patent No. 5489533. 


6.50e+01 


8 85 


4.4 252 


PCT-DS95-1 


Sequence 103, Applicat 


6.50e+01 


9 85 


4.4 274 


5489533-2 


Patent No, 5489533. 


6.50e+01 


10 85 


4.4 274 


5512660-2 


Patent No, 5512660. 


6.50e+01 


' 11 84 


4.3 316 


OS-08-118- 


Sequence 46, Applicati 


7.54e+01 


12 84 


4,3 316 


PCT-OS93-0 


Sequence 46, Applicati 


7.54e+01 


13 83 


4.3 345 


PCT-OS96-1 


Sequence 2, Applicatio 


8.75e+01 


14 83 


4.3 363 


OS-08-148- 


Sequence 2, Applicatio 


8.75e+01 


15 83 


4,3 353 


aS-08-148- 


Sequence 3, Applicatio 


8.75e+01 


16 83 


4,3 380 


PCT-DS93-1 


Sequence 40, Applicati 


8.75e+01 


17 83 


4,3 380 


OS-08-153- 


Sequence 40, Applicati 


8.75e+01 


18 84 


4,3 494 


PCT-OS94-0 


Sequence 4, Applicatio 


7.54e+01 


19 84 


4.3 494 


CS-08-454- 


Sequence 4, Applicatio 


7,54e+01 


20 84 


4,3 2368 


US-08-198- 


Sequence 15, Applicati 


7,54e+01 


21 82 


4.2 225 


5436139-4 


Patent No, 5436139. 


l,01e+02 


22 82 


4,2 226 


5436139-5 


Patent No. 5436139. 


l,01e+02 


23 82 


4.2 226 


5196194-21 


Patent No, 5196194. 


l,01e+02 



24 


82 


4,2 


226 


PCT-US96-1 




1 


Ole+02 


25 


-,^..82 


4,2 


226 


5198343.7,1 


Patent No. 5198348. 


-i. 


,Oi^,M..~ 


26 


82 


4.2 


226 


UO vO J/0 




1 


Ole+02 


27 


82 


4.2 


235 


[jg.()8.3]8- 


Sst^usncs 1/ Applicatio 


1 


Ole+02 


28 


82 


4,2 


277 


5164485-2 


Paf-pnt- Nn MfiAAftR 

raLcllL HU. JiU't40J. 


I 


Ole+02 


29 


82 


4,2 


279 


L us u/ DOO 


S6(5U6nc6 30f Applicati 






30 


82 


4,2 


279 


PCT-tJS91-0 


Ss^usnc6 29/ Applicati 


1 


Ole+02 


31 


, 82 


4,2 


281 


FK-fifl-iSfl- 

\J0 uO 'iJQ 




1 


Ole+02 


32 


82 


4.2 


281 


US-08-105- 


Ss^usncs 214/ Applicat 


1 


Ole+02 


33 


82 


4.2 


351 




SGcju6nc6 4| Applicatio 


1 


Ole+02 


34 


82 


4.2 


361 


PCT-US93-0 


S6cjusnc6 4/ Applicatio 


1 


Ole+02 


35 


82 


4.2 


389 


re. no. 7(10. 

UD UO f\JJ 


Se^u6nc6 216/ Applicat 


1 


Ole+02 


36 


82 


4,2 


389 


US-08-709- 


S6cjusnc6 219/ Applicat 


1 


Ole+02 


37 


82 


4^2 


389 


L US-08-105- 


Sequence 219, Applicat 


1 


Ole+02 


38 


82 


4.2 


389 


l US-08-458- 


Sequence 216, Applicat 


1 


Ole+02 


39 


82 


4.2 


389 


L US-08-458- 


Sequence 219, Applicat 


1 


Ole+02 


40 


82 


4.2 


389 


L US-08-105- 


Sequence 216, Applicat 


1 


Ole+02 


41 


82 


4.2 


395 


5196194-18 


Patent No. 5196194, 


1 


Ole+02 


42 


82 


4.2 


397 


I PCT-aS95-l 


Sequence. 6, Applicatio 


1 


Ole+02 


43' 


"sr 


■■4.2 


502-" 


L' US-08-T96- 


Sequence 4, Applicatio 


1 


Ole+02 


44 


02 


4.2 


502 


L US-08-466- 


Sequence 10, Applicati 


1 


Ole+02 


45 


82 


4.2 


995 


2 PCT-US95-0 


Sequence 14, Applicati 


1 


Ole+02 



RESULT 1 

ID PCT-US95-07391A-2 STANDARD; PRT; 772 AA, 

XX ■ ' 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 2, Application PC/TOS9507391A 
XX 

CC Sequence 2, Application PC/TOS9507391A 

CC GENERAL INFORMATION; 

CC APPLICANT; IBEX TECHNOLOGIES and 

CC APPLICANT; ZIMMERMANN, Joseph 

CC TITLE OF INVENTION; Nucleic Acid Sequences And Expression 

CC TITLE OF INVENTION; Systems For Heparinase II And Heparinase III Derived Fr 

CC TITLE OF INVENTION; Flavobacterium heparinum 

CC NUMBER OF SEQUENCES: 26 

CC CORRESPONDENCE ADDRESS; 

CC ADDRESSEE: Hale and Dorr 

CC STREET: 1455 Pennsylvania Avenue, N.W. 

CC CITlf; Washington, D.C. 

CC COUNTRY: U.S. A, 

CC ZIP: 20004 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentin Release »1.0, Version tl.25 

CC CURRENT APPLICATION DATA: I- 

CC APPLICATION NUMBER: PCT/US95/07391A 

CC FILING DATE: 09-JUNE-1995 

CC CLASSIFICATION: 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER: 08/258,639 

CC FILING DATE: 10 JUNE 1994 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: BAKER, Hollie l: 

CC REGISTRATION NUMBER: 31,321 

CC REFERENCE/DOCKET NUMBER: 104385, 116PCT 

CC TELECOMMUNICATION INFORMATION: 

CC ' TELEPHONE; (202)942-8400 

. CC TELEFAX: (202)942-8484 

CC INFORMATION FOR SEQ ID NO: 2: 

CC SEQUENCE CHARACTERISTICS; 

CC 1 LENGTH; 772 amino acids 

CC TYPE; amino acid 

CC TOPOLOGY; linear 
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CC MOLECULE TYPE: protein 

SQ SEQOENCE 772 AA; 87638 MW; 3109034 CN; 

Query Match 4.8%; Score 94; DB 2; Length 772; 

Best Local Similarity 26.1%; Pred. No. 1.66e+01; 

Matches 23; Conservative 25; Mismatches 35; Indels 5; Gaps 3; 

Db 94 DFRFYFNQKGLTVRVELMALNYLMTKDPKVGREAITSIIDTLETATFKPAGDISRGIGLF 153 

II I ;: I :|:;|l :|:;:; l; :: :l II : :|| I I 
Qy 66 DFIGEFMKASVDV-ADLIGLNLVMSRNA--GKGEYKIMVAALGWAT--AELIMSRCIPLW 120 

Db 154 MVTGAIVYDWCYDQLKPEEKTRFVKAFV 181 

; : :| III I I: : : :| :| 
Qy 121 VGARGIEFDWKYIQMSIDSNISLVHYIV 148 



RESULT 2 

ID US-08-258-639A-2 STANDARD; PRT; 772 AA. 
XX 

AC xxxxxx 

XX . 
DT 
XX 

DE Sequence 2, Application US/08258639A 

XX ' 

CC Sequence 2, Application US/08258639A 

CC Patent No. 5681733 

CC GENERAL INFORMATION: 

CC APPLICANT: Su, Hongsheng 

CC APPLICANT: Slain, Francoise 

CC APPLICANT: Bennett, Clark 

CC APPLICANT: Gu, Kangfu 

CC APPLICANT: Zinmerinann, Joseph 

CC APPLICANT: Musil, Roy 

CC TITLE OF INVENTION: Nucleic Acid Sequences And Expression 

CC TITLE OF INVENTION: Systems For Heparinase II And Heparinase III Derived 

CC TITLE OF INVENTION: Flavobacterium heparinui 

CC NUMBER OF SEQUENCES: 25 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Hale and Dorr 

CC STREET: 1455 Pennsylvania Avenue, N.W, 

CC CITY: Washington, D.C. 

CC COUNTRY: U.S.A. 

CC ZIP; 20004 

CC ■ COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentin Release 11. 0, Version tl,25 

CC • CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/258, 639A 

CC FILING DATE: 10 JUNE 1994 ■ " 

CC CLASSIFICATION: 435 . . . ■ 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Healey, William J. 

CC REGISTRATION NUMBER: 36,160 

CC REFERENCE/DOCKET NUMBER: 104385.115 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (202)942-8400 

CC TELEFAX: (202)942-8484 

CC INFORMATION FOR SEQ ID NO: 2: 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH: 772 amino acids 

CC TYPE: amino acid 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 772 AA; 87638 MW; 3109034 CN; 

Query Match 4.8%; Score 94; DB 1; Length 772; 

Best Local Similarity 26.1%; , Pred. No. l,66e+01; , , ■ , 

Matches 23; Conservative ' 25; Mismatches 35; Indels 5; Gaps 3; 

Db 94 DFRFYFNQRGLTVRVELMALNYLMTKDPKVGREAITSIIDTLETATFKPAGDISRGIGLF 153 



II I I :i::il :|:::: |: :: :i || : :|| | | 
Qy 65^DFIGEFMKASVDV-ADLIGLNLVMSRNA:;.GKGEYKIMVAALGWAT-AELIMSRCIPLW.,120 

Db 154 MVTGAIVYDWCYDQLKPEEKTRFVKAFV 181 

: : :| ;||| I: : : :| ;| 
Qy 121 VGARGIEFDWKYIQMSIDSNISLVHYIV 148 



RESULT 3 

ID US-08-118-270-71 STANDARD; PRT; 349 AA. 
. XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 71, Application US/08118270 
XX 

CC Sequence 71, Application DS/0B118270 

CC Patent No. 5508384 

CC GENERAL INFORMATION: 

CC APPLICANT: Murphy, Randall B. 

CC APPLICANT; Schuster, David I. 

CC TITLE OF INVENTION: POLYPEPTIDES OF G-COUPLED PROTEIN 

CC TITLE OF INVENTION: RECEPTORS, AND COMPOSITIONS AND METHODS THEREOF 

_.„...CC.„.„ .NUMBER.,OF SEQUENCES,:., 348 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: BROWDY AND NEIMARK 

CC STREET: 419 Seventh Street, N.W., Suite 300 

CC CITY: Washington 

CC STATE; D.C, 

CC COUNTRY; USA 

CC ZIP; 20004 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentin Release- 11.0, Version tl. 25 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/118,270 

CC FILING DATE: 09-SEP-1993 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 07/943,236 

CC FILING DATE; lO-SEP-1992 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME; Townsend, Kevin G. 

CC REGISTRATION NUMBER: 34,033 

CC REFERENCE/DOCKET NUMBER; MURPHY-2A 

CC -TELECOMMUNICATION INFORMATION; 

CC TELEPHONE; 202-628-5197 

, CC TELEFAX: 202-737-3528 

CC ■ TELEX: 248633 

-i" , -CC INFORMATION FOR SEQ ID NO: 71: i 

CC • SEQUENCE CHARACTERISTICS: 

CC '■■ LENGTH; 349 amino acids 

CC TYPE: amino acid 

CC STRANDEDKESS; single 

i CC TOPOLOGY: linear 

> CC MOLECULE TYPE; peptide 

I SQ SEQUENCE 349 AA; 39913 MW; 595896 CN; 

I ^ Query Match 4.6%; Score 89; DB 1; Length 349; 

i ■ Best Local Similarity 20,5%; Pred. No. 3.57e+01; 

' Matches 18; Conservative 29; Mismatches 38; Indels 3; Gaps 3; 

Db 94 AIAVDRHQVIMPLKPRISITKGVIYIAVIWVMTFFSLPHAICQRLFTFKYSEDIVRSLCL 153 

I : .:l -1: .: I:; : I' I : ' :: I ;|;;| : 1 I : :; ::; ; ;:; : 

j Qy 125 GIEFDWKYIQMSIDSNISLVHYIVASAQVWMIIRYDLYHNFRPAVLLLMFL-SVYKAFVM 183 

I Db, 154 DPFPEPADL-FWKYLDIATFILLYLLPL 180 

I ' : I Mil::: |{;| 

I Qy 184 ETFVHLCSLGSWARLDARA-WTGLLAL 210 
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RESOLT 4 

ID PCT-OS93-0852B-71 STANDARD; PRTj 349 AA, ...... ... 

XX 

AC xxxra 

XX 

DT 

XX 

DE Sequence 71, Application PC/TOS9308528 
XX 

CC Sequence 71, Application PC/TOS9308528 

CC GENERAL INFORMATION: 

CC APPLICANT: New York University 

CC TITLE OF INVENTION: POLYPEPTIDES OF G-COUPLED PROTEIN 

CC TITLE OF INVENTION: RECEPTORS, AND COMPOSITIONS AND METHODS THEREOF 

CC NUMBER OF SEQUENCES: 348 

CC CORRESPONDENCE ADDRESS; 

CC ADDRESSEE: BROWDY AND NEIMARK 

CC STREET: 419 Seventh Street, N.W., Suite 300 

CC CITY: Washington 

CC STATE; D.C, 

CC ' COUNTRY: USA — ' 

CC ZIP: 20004 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC Compatible 

CC ' OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentin Release fl.O, Version *1.25 

CC CURRENT APPLICATION DATA; 

CC APPLICATION NUMBER: PCT/US93/08528 

CC FILING DATE: 09-SEP-1993 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER: US 07/943,236 

CC FILING DATE: lO-SEP-1992 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Tovrasend, Kevin G. 

CC REGISTRATION NUMBER: 34,033 

CC REFERENCE/DOCKET NUMBER; MURPHy=2 PCT , 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE; 202-628-5197 

CC TELEFAX: 202-737-3528 

CC TELEX: 248633 

CC INFORMATION FOR SEQ ID NO; 71: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 349 amino acids 

CC TYPE; amino acid 

CC STRANDEDNESS; single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: peptide 

SO SEQUENCE 349 AA; 39913 MW; 695896 CN; 

Query Match • 4.6%; Score 89; DB 2; Length 349; 

Best Local Similarity 20. 5%; Pred. No. 3.57e+01; 

Matches 18; Conservative 29; Mismatches 38; Indels 3; Gaps 3; 

Db 94 AIAVDRHQVIMPLKPRISITKGVIYIAVIWVMTFFSLPHAICQKLFTFRYSEDIVRSLCL 153 

;| I: ; I:: : l|; :; I ;|;;| : I I : ;; ;:; : ;:; ; 

Qy 125 GIEFDWKYIQMSIDSNISLVHYIVASAQVWMITRYDLYHNFRPAVLLLMFL-SVYKAFVM 183 

Db 154 DPFPEPADL-FWKYLDIATFILLYLLPL 180 

: I Mil;;: l|;| 

Qy 184 ETFVHLCSLGSWARLDARA-WTGLLAL 210 



RESULT 5 

ID US-08-948-616-3 STANDARD; PRT; 307 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 3, Application US/08948616 
XX 

CC sequence 3, Application OS/08948616 



Patent No, 5840539 
GENERAL INFORMATION :- 



CC 
CC 

CC APPLICANT: Hillman, Jennifer L. 

CC APPLICANT: Lai, Preeti 

CC APPLICANT: Shah, Purvi 

CC APPLICANT: Corley, Neil C, 

CC TITLE OF INVENTION; VESICLE TRANSPORT ASSOCIATED PROTEINS 

CC NUMBER OF SEQUENCES: 11 

CC CORRESPONDENCE ADDRESS; 

CC ADDRESSEE; Incyte Pharmaceuticals, Inc. 

CC STREET: 3174 Porter Drive 

CC CITY: Palo Alto 

CC STATE; CA 

CC COUNTRY; USA 

CC ZIP: 94304 

CC COMPUTER READABLE FORM; 

CC MEDIUM TYPE: Diskette 

CC COMPUTER: IBM Compatible 

CC OPERATING SYSTEM; DOS 

CC SOFTWARE: FastSEQ for Windows Version 2.0 

'■TC-"'~TURRENT -APPLICATION DATA: 

CC APPLICATION NUMBER; US/08/948,616 

CC FILING DATE: Herewith 

CC CLASSIFICATION: 530 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER; 

CC FILING DATE: 

CC ATTORNEY/AGENT INFORMATION; 

CC NAME; Billings, Lucy J. 

CC REGISTRATION NUMBER: 36,749 

CC REFERENCE/DOCKET NUMBER: PF-0409 US 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE; 650-855-0555 

CC TELEFAX; 650-845-4166 

CC TELEX; 

CC INFORMATION FOR SEQ ID NO: 3: 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH: 307 amino acids 

CC TYPE; amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC IMMEDIATE SOURCE: 

CC LIBRARY: LUNGTUT07 

CC CLONE: 2607662 

SQ SEQUENCE 307 AA; 34947 MW; 494826 CN; 

Query Match 4.5%; Score 87; DB 1; Length 307; 

Best Local Similarity 26.9%; Pred. No. 4.82e+01; 

Matches 14; conservative 14; Mismatches 22; Indels 2; Gaps 2; 

Db 240 YQCYLLVYYTG-WRNVKSFLTFGLICLCNM-YLYELRNLWQLFFHVTVGAFV 289 

, I I II : I; I : :|: :; ||;| :| : |; : :| |: 

Qy 21 YRCSGLSEYNAFWKCVQAGVTYLFVQLCKMLFLATFFPTWEGGIYDFIGEFM 72 



RESULT 6 

ID US-08-553-703A-2 STANDARD; PRT; 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 2, Application US/08553703A 
XX 

Sequence 2, Application US/08553703A 
Patent No. 5795767 
GENERAL INFORMATION: 



APPLICANT; 
APPLICANT; 
APPLICANT; 
TITLE 



MARU, ISAFUMI 
OHTA, YASUHIRO 
TSOKADA, YOJI 
INVENTION: EPIMERASE 



NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS; 



Best Available Copy 

Sun Jan 24 11:29:01 1999 US-09-162-597-3lrai 



Page 4 



CC ADDRESSEE: Knobbe, Martens, Olson 4 Bear 

CC STREET: 620 Newport Center Drive 16th Floor 

CC CITY; Newport Beach 

CC STATE: CA 

CC COUNTRY: O.S.A. 

CC ZIP; 92660 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE; Diskette 

CC COMPUTER; IBM Compatible 

CC OPERATING SYSTEM; DOS 

CC SOFTWARE; FastSEQ Version 1.5 

CC CURRENT APPLICATION DATA; 

CC APPLICATION NUMBER; US/08/553, 703A 

CC FILING DATE; 30-NOV-1995 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: 

CC FILING DATE: 

CC ATTORNEY/AGENT INFORMATION; 

CC NAME: Altman, Daniel E 

CC 'REGISTRATION NUMBER; 34,115 

CC REFERENCE/DOCKET NUMBER: 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 714-760-0404 

CC TELEFAX: 714-760-9502 

CC TELEX: 

CC INFORMATION FOR SEQ ID NO: 2: 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH: 417 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS; single 

CC TOPOLOGY; linear 

CC MOLECULE TYPE: peptide 

CC FRAGMENT TYPE: N- terminal 

SQ SEQUENCE 417 AA; 47746 MW; 847834 CN; 

Query Match 4,5%; Score 87; DB 1; Length 417; 

Best Local Similarity 23,8%; Pred, No. 4.82e+01; 

Matches 15; Conservative 18; Mismatches 26; Indels 4; Gaps 4 

Db 19 LDRWAFWMEHSHDQEHGGFFTCLGREGRVYDDLKYVWLQGRQVWMYCRLYRTFERFRHA 78 

: I :::|; "I ;:;:;:; |: : : |||| | | : || | 

Qy 113 MSRCIPLWVG-ARGIEFDWKYIQMSIDSNI-SLVHYI-VASAQVWMITR-YDLYHNFRPA 168 

Db ' 79 QLL 81 

II 

Qy 169 VLL 171 



RESULT 7 
ID 5489533-3 
XX 

XXXXXX 



Patent No. 5489533. 

Patent No, 5489533 

APPLICANT: SPRINGER, TIMOTHY A.; STAUNTON, DONALD E,; ' 
DUSTIN, MICHAEL L. 

TITLE OF INVENTION; ISOLATED NUCLEIC ACID MOLECULES ENCODING 
ICAM-2 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA; 
APPLICATION NUMBER; US/08/384,814 
FILING DATE; 06-FEB-1995 
PRIOR APPLICATION DATA: ^ 
APPLICATION NUMBER; 89,3075 
FILING DATE; 12-JUL-1995 
APPLICATION NUMBER; 89,307 
FILING DATE: 12-jnL-1993 
APPLICATION NUMBER; 454,294 



FILING DATE: 
_ APPLICATION 
FILING DATE: 
APPLICATION 
FILING DATE: 
APPLICATION 
FILING DATE: 
APPLICATION 
FILING DATE: 
APPLICATION 
FILING DATE: 

SEQ ID NO; 3: 

LENGTH; 201 

SEQUENCE 218 AA; 



22-DEC-1989 
NUMBER; 45,963 .,. 

04-MAY-1987 
NUMBER; 115,798 

02- NOV-19B7 
NUMBER; 155,943 

16-FEB-1988 
NUMBER; 189,815 

03- MAY-1988 
NUMBER; 250,446 

28-SEP-1988 



24265 MW; 273614 CN; 



Sequence 103, Application PC/TUS9514442A 
GENERAL INFORMATION; 



Query Match 4.44; Score 85; DB 3; Length 201; 

Best Local Similarity 28.3%; Pred. No. 6.50e+01; 

Matches 17; Conservative 14; Mismatches 25; Indels 4; Gaps 4; 

Db 52 QWKHYLVSNISHDTVLQCHFTCSGKQESMNSNVSVYQPPRQVILTLQPILVAVGKSFTIE 111 

:|| I: :| : II; ;: I I ; :|: | ;| | | :| |:| :| 
Qy 129 DWK-YIQMSIDSNISL-VHYIVASAQVWMITRYDLYHNFRPAVLLLM-FL-SVYKAFVME 184 



RESULT 8 

ID PCT-US95-14442A-103 STANDARD; PRT; 

XX - ~ 

AC XXXXXX 

XX 

DT 

XX 

DE Sequence 103, Application PC/TUS9514442A 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
„ ,.CC . 
CC 
CC 
CC 
CC 
' CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT: 



Grieve, Robert B. 
Rushlow, Keith E. 
Hunter, Shirley Wu 
Frank, Glenn R. 
Heath, Andrev W. 
Yamaka, Miles Yamanaka 
Arfsten, Ann 
Dale, Beverly 
Stiegler, Gary 

USE OF PROTEASE INHIBITORS AND 
PROTEASE VACCINES TO PROTECT ANIMALS FROM FLEA 
INFESTATION, AND FLEA PROTEASE PROTEINS, NUCLEIC ACID 
MOLECULES, AND U 
.119 



TITLE OF INVENTION 
TITLE OF INVENTION; 
TITLE OF INVENTION 
TITLE OF INVENTION 
.NUMBER OF SEQUENCES: 
CORRESPONDENCE ADDRESS; 
ADDRESSEE: Sheridan Ross i Mcintosh 
STREET; 1700 Lincoln Street, Suite 3500 
CITY; Denver 
STATE; Colorado 
COUNTRY: USA " 
ZIP; 80203 
COMPUTER READABLE FORM; 
MEDIUM TYPE; Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release 11. 0, Versioi 
CURRENT APPLICATION DATA;' 
APPLICATION NUMBER: PCT/US95/14442A 
FILING DATE: 
CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
NAME: Gary J. Connell . 
REGISTRATION NUMBER; 32,020 
REFERENCE/DOCKET NUMBER: 
TELECOMMUNICATION INFORMATION; 
TELEPHONE: (303) 863-9700 
TELEFAX; (303 863-0223 
INFORMATION FOR SEQ ID NO: 103; 



Best Available Copy 

Sun Jan 24 11:29:01 1999 US-09-1627597-3.rai Page 5, 



CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 252 amino acids 

CC TYPE: amino acid 

CC TOPOLOGY: linear 

CC • MOLECOLE TYPE; protein 

SO SEQUENCE 252 AA; 27480 MW; 319731 CN; 

Query Match 4,44; Score 85; DB 2; Length 252; 

Best Local Similarity 31,34; Pred, No. 6,50e+01; 

Matches 15; Conservative 14; Mismatches 15; Indels 4; 

Db 150 GWGRLGAGRPIPNKLQELQTFA-LSDK-DCTVKTGLVPKSQLCVFRAS 195 

:|:|| II:: III::: :|:: III: : : :| II 
Qy 194 SWARLDA-RAWTGLLALKHFGPVCRRCQCAL-LGLVSQTLMYLFPAS 239 



RESULT 9 
ID 5489533-2 
XX 

AC xxxxxx 
XX ' 
DT 
XX 



Ol-JAN-1900 

DE Patent No. 5489533. 
XX 

CC htent NO, 5489533 

CC APPLICANT: SPRINGER, TIMOTHY A.; STAUNTON, DONALD E.; 

CC DUSTIN, MICHAEL L. 

CC TITLE OF INVENTION; ISOLATED NUCLEIC ACID MOLECULES ENCODING 

CC ICAM-2 

CC NUMBER OF SEQUENCES; 16 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/384,814 

CC FILING DATE; 06-FEB-1995 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: 89,3075 

CC FILING DATE: 12-JUL-1995 

CC APPLICATION NUMBER: 89,307 

CC FILING DATE: 12-JUL-1993 

CC APPLICATION NUMBER: 454,294 

CC FILING DATE: 22-DEC-1989 

CC APPLICATION NUMBER: 45,963 

CC FILING DATE: 04-MAY-1987 

CC APPLICATION NUMBER: 115,798 

CC FILING DATE: 02-NOV-1987 

CC APPLICATION NUMBER: 155,943 

CC FILING DATE: 16-FEB-1988 

CC APPLICATION NUMBER: 189,815 

CC FILING DATE: 03-MAY-1988 

CC APPLICATION NUMBER; 250,446 

CC • FILING DATE: 28-SEP-1988 

CC SEQ ID NO; 2: 

CC LENGTH: 274 

297 AA; 33056 MW; 520742 CN; 



'Query Match 4.44; Score 85; DB 3; Length 274; 

Best Local Similarity 28,34; Pred, No. 6,50e+01; 

Matches 17; Conservative 14; Mismatches 25; Indels 4; Gaps 4 

Db 73 QWKHYLVSNISHDTVLQCHFTCSGKQESMNSNVSVYOPPRQVILTLQPTLVAVGKSFTIE 132 

:|| I; :| : I I: :: I I : :|: I :| I I ;| |:| :| 
Qy 129 DWR-YIQMSIDSNISL-VHYIVASAQVWMITRYDLYHNFRPAVLLLM-FL-SVYRAFVME 184 



ID 5512660-2 STANDARD; 
XX 

AC XXXXXX 
XX 

DT Ol-JAN-1900 
XX 

DE Patent No. 5512660, 
XX 



PRT; 297 AA. 



CC Patent No. 5512660 

CC ....APPLICANT: SPRINGER, TIMOTHY. A. ■;STAUNTON, DONALD E.; 

CC DUSTIN, MICHAEL L. 

CC TITLE OF INVENTION; PURIFIED ICAM-2 AND FRAGMENT THEREOF 

CC NUMBER OF SEQUENCES: 16 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER; US/08/194,613 



CC 


FILING DATE: 10 -FEB 


1994 


CC 


PRIOR APPLICATION DATA: 




APPLICATION NUMBER; 


89,307 


CC 


FILING DATE; 12- JUL 


1993 


CC 


APPLICATION NUMBER: 


454,294 


CC 


FILING DATE; 22-DEC 


1989 


CC 


APPLICATION NUMBER; 


45,963 


CC 


FILING DATE; 04 -MAY 


1987 


CC 


APPLICATION NUMBER; 


115,798 


CC 


FILING DATE; 02 -NOV 


1987 


CC 


APPLICATION NUMBER: 


155,943 


CC 


FILING DATE: 16-FEB 


1988 


CC 


. APPLICATION NUMBER: 


189,815 


CC 


FILING DATE: 03 -MAY 


1988 


CC 


APPLICATION NUMBER: 


250,446 


CC 


FILING DATE; 28-SEP 


1988 


CC . 


EQ ID N0:2: 




CC 


LENGTH; 274 





SQ SEQUENCE 297 AA; 33056 MW; 520742 CN; 

Query Match 4.44; Score 85; DB 3; Length 274; 

Best Local Similarity 28,34; Pred. No. 6,50e+01; 

Matches 17; Conservative 14; Mismatches 25; Indels 4; Gaps 4; 

Db 73 QWKHYLVSNISHDTVLQCHFTCSGKQESMNSNVSVYQPPRQVILTLQPTLVAVGKSFTIE 132 

:|| I: :| : I I: ;: I I : :|: I'M I :| hi :| 
Qy 129 DWK-YIQMSIDSNISL-VHYIVASAQVWMITRYDLYHNFRPAVLLLM-FL-SVYKAFVME 184 



ID 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 

'W CC 
CC 
CC 
CC 
CC ■ 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



US-08-118-270-45 STANDARD; 



Sequence 46, Application US/08118270 

Sequence 46, Application US/08118270 
Patent No. 5508384 
GENERAL INFORMATION; 
APPLICANT: Murphy, Randall B. 
APPLICANT; Schuster, David I, 

TITLE OF INVENTION; POLYPEPTIDES OF G-COUPLED PROTEIN 
TITLE OF INVENTION: RECEPTORS, AND COMPOSITIONS AND 
NUMBER OF SEQUENCES: 348 
CORRESPONDENCE ADDRESS; 

ADDRESSEE; BROWDY AND NEIMARK 

STREET; 419 Seventh Street, N.W., Suite 300 

CITY: Washington 

STATE; D.C, 

COUNTRY: USA 

ZIP; 20004 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
' COMPUTER; IBM PC compatible 

OPERATING SYSTEM; PC-DOS/MS-DOS 

SOFTWARE; Patentin Release tl.O, Version fl.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/118,270 

FILING DATE: 09-SEP-1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/943,236 

FILING DATE: lO-SEP-1992 
ATTORNEY/AGENT INFORMATION: 

NAME: Townsend, Kevin G. 



Best Available Copy 
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CC REGISTRATION NUMBER: 34,033 

CC REFERENCE/DOCKET NUMBER: MURPHY-2A 

CC TELECOMMUNICATION INFORMATION; 

CC TELEPHONE; 202-628-5197 

CC TELEFAX; 202-737-3528 

CC TELEX: 248633 

CC INFORMATION FOR SEQ ID NO; 46; 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 315 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS; Single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: peptide 

316 AA; 35040 MW; 553094 CN; 



Query Match 4.3%; Score 84; ' DB 1; Length 316; 

Best Local Similarity 37. 0»; Pred. No. 7,54e+01; 

Matches 10; Conservative 10; Mismatches 5; Indels 2; Gaps 2; 

Db 43 ASADLIIA-CGLPFWAITIANNFDWLF 68 

|:|:||:: I ;|;| : : :||| : 
Qy 107 ATAELIMSRC-IPLWVGARGIEFDWKY 132 



PRT; 316 AA. 



RESULT 12 

ID PCT-US93-08528-46 STAI^ 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 46, Application PCAOS9308528 

XX 

CC sequence 46, Application PC/TUS9308528 

CC GENERAL INFORMATION: 

CC APPLICANT; New York University 

CC TITLE OF INVENTION: POLYPEPTIDES OF G-COOPLED PROTEIN 

CC TITLE OF INVENTION: RECEPTORS, AND COMPOSITIONS AND METHODS THEREOF 

CC NUMBER OF SEQUENCES; 348 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: BROWDY AND NEIMARK 

CC STREET: ' 419 Seventh Street, N.w,, Suite 300 

CC CITY; Washington 

CC STATE: D.C. 

CC ' COUNTRY: USA 

CC ZIP: 20004 

CC COMPUTER READABLE FORM; 

CC MEDIUM TYPE; Floppy' disk 

CC COMPUTER; IBM PC compatible 

CC • OPERATING SYSTEM; PC-DOS/MS-DOS 

CC SOFTWARE: Patentin Release tl.O, Version 11.25 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER; PCT/US93/08S28 

CC FILING DATE; 09-SEP-1993 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER: US 07/943,236 

CC FILING DATE; lO-SEP-1992 

CC ATTORNEY/AGENT INFORMATION; 

CC NAME: Townsend, Kevin G, 

CC REGISTRATION NUMBER: 34,033 

CC , REFERENCE/DOCKET NUMBER; MURPHY-2 PCT 

CC TELECOMMUNICATION INFORMATION; 

CC TELEPHONE; 202-628-5197 

CC TELEFAX: 202-737-3528 

CC TELEX: 248633 

CC INFORMATION FOR SEQ ID NO; 46: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 315 amino acids 

CC TYPE; amino acid 

CC STRANDEDNESS; single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: peptide 

SQ SEQUENCE 316 AA; 35040 MW; 553094 CN; 



■Query Match 4.3t; Score„84; DB 2; Length 316; 

Best Local Similarity 37.0*; Pred. No, 7,54e+01; 
Matches 10; Conservative 10; Mismatches 5; Indels 2; Gaps 2; 

Db 43 ASADLIIA-CGLPFWAITIANNFDWLF 68 

|;|;||:; I i : : 
3y 107 ATAELIMSRC-IPLWVGARGIEFDWKY 132 



RESULT 13 

ID PCT-US96-10602-2 

XX 

AC xxxxxx 
XX 

DT 



XX 

DE Sequence 2, Application PC/TUS9610602 
XX 

CC Sequence 2, Application PC/TDS9610602 

CC GENERAL INFORMATION; 

CC APPLICANT; The General Hospital Corporation 

CC TITLE OF INVENTION; INHIBITION OF HEPATITIS B REPLICATION 

CC , NUMBER OF SEQUENCES: 14 

CC CORRESPONDENCE ADDRESS; 

CC ADDRESSEE; Fish S Richardson P.C. 
,. 225"Franklin -Street""" 

CC CITY; Boston 

CC STATE: MA 

CC COUNTRY; USA 

CC ZIP: 02110-2804 

CC COMPUTER READABLE FORM; 

CC MEDIUM TYPE; Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM; PC-DOS/MS -DOS 

CC SOFTWARE; Patentin Release fl.O, Version §1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: PCT/US96/10602 

CC FILING DATE; 

CC CLASSIFICATION; 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER: 60/017,814 

CC FILING DATE; 20-JUN-1995 

CC CLASSIFICATION; 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME; Clark, Paul T, 

CC REGISTRATION NUMBER: 30,162 

CC REFERENCE/DOCKET NUMBER; 00786/282001 

CC .TELECOMMUNICATION INFORMATION; ... 

CC TELEPHONE; 617/542-5070 

CC TELEFAX: 617/542-8906 

CC TELEX: 200154 

CC INFORMATION FOR SEQ ID NO; 2; 

' CC SEQUENCE CHARACTERISTICS: 

CC LENGTH; 345 amino acids 

CC TYPE; amino acid 

CC TOPOLOGY; linear 

CC MOLECULE TYPE; protein 

SQ SEQUENCE 346 AA; 40128 MW; 678897 CN; 



Query Match 4.3*; Score 83; DB 2; Length 346; 

' Best Local Similarity 35.1*; Pred. No. 8,75e+01; 
^ Matches 13; Conservative 10; Mismatches 10; Indels 4; 

Db 192 NGFRWMYLRRFIIYLLVLLLC-LIFLLVLL-DWKGLI 226 

1:1 I: :: : I I :: I I I 
3y 30 NAF-WKCVQAGVTYLFVQL-CKMLFLATFFPTWEGGI 64 



ILT 14 
US-08-148-209A-2 



Best Available Copy 

^ sun Jan 24 11:29:01 1999 



US-09-162-597-3.rai 



Page 7 



XX 

DT „„. 

XX 

DE Sequence 2, Application US/08148209A 
XX 

CC Sequence 2, Application US/08148209A 

CC Patent No. 5556780 

CC GENERAL INFORMATION; 

CC APPLICANT: Dzau, Victor J 

CC APPLICANT: Mukoyama, Masashi 

CC TITLE OF INVENTION: TYPE-2 ANGIOTENSIN II RECEPTOR AND GENE 

CC NUMBER OF SEQUENCES: 15 

CC CORRESPONDENCE ADDRESS; 

CC ■ ADDRESSEE; FLEHR, HOHBACH, TEST, ALBRITTON fi HERBERT 

CC STREET; 4 Embarcadero Center, Suite 3400 

CC CITY; San Francisco 

CC STATE: California 

CC COUNTRY: USA 

CC ZIP; 94111-4187 

CC COMPUTER READABLE FORM: ^" ' " ~ 

CC ' MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS -DOS 

CC SOFTWARE: Patentin Release tl.O, Version tl.25 

CC CURRENT APPLICATION DATA; 

CC ■ APPLICATION NUMBER: OS/08/148, 209A 

CC FILING DATE: 05-NOV-1993 

CC CLASSIFICATION; 800 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Rowland, Bertram I 

CC REGISTRATION NUMBER; 20,015 

CC REFERENCE/DOCKET NUMBER: A-58491-1/BIR 

CC TELECOMMUNICATION INFORMATION; 

CC TELEPHONE; (415) 781-1989 

CC TELEFAX; (415) 398*3249 

CC TELEX: 910 277299 

CC INFORMATION FOR SEQ ID NO: 2; 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH: 363 amino acids 

CC TYPE; amino acid 

CC TOPOLOGY: linear 

CC MOLECULE TYPE; protein 

SQ SEQUENCE 363 AA; 41373 MW; 722740 CN; 

Query Match 4.3%; Score 83; DB 1; Length 363; 

Best Local Similarity 32.6%; Pred. No, 8.75e+01; 

Matches 14; Conservative 11; Mismatches 14; Indels 4; Gaps 4; 

Db 172 CLSSLPTFY-FRD-VRT-IEYLGVNACIMAFPPEKYAQWSAGI 211 

I hi: : I: |;; ; II I III: :: I ill 

Qy 23 C-SGLSEYNAFWKCVQAGVTYLFVQLCKMLFLATFFPTWEGGI 64 



CC STATE: California 

CC — GOUNTRT: USA — — 

CC ZIP: 94111-4187 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER; IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE; Patentin Release fl.O, Version 11.25 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: OS/08/148, 209A 

CC FILING DATE: 05-NOV-:993 

CC CLASSIFICATION; 800 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Rowland, Bertram I 

CC REGISTRATION NUMBER; 20,015 

CC REFERENCE/DOCKET NUMBER; A-58491-1/BIR 

CC TELECOMMUNICATION INFORMATION; 

CC TELEPHONE: (415) 781-1989 

CC TELEFAX: (415) 398*3249 

CC TELEX; 910 277299 

CC INFORMATION FOR SEO ID NO: 3: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 363 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS; Single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: peptide 

SQ SEQUENCE 363 AA; 41330 MW; 725910 CN; 

Query Match 4.3%; Score 83; DB 1; Length 363; 

Best Local Similarity 32.6%; Pred. No. 8.75e+01; 

Matches 14; Conservative 11; Mismatches 14; Indels 4; 

Db 172 CLSSLPTFY-FRD-VRT-IEYLGVNACIMAFPPEKYAQWSAGI 211 

I I;|; : I; h; ; II I IN: :: I :|| 

Qy 23 C-SGLSEYNAFWKCVQAGVTYLFVOLCKMLFLATFFPTWEGGI 64 



Search completed; Fri Jan 22 18:03:21 1999 
Job time : 14 sees. 



RESULT 15 

ID OS-08-148-209A-3 STANDARD; PRT; 353 AA. ' i) , 

XX 

AC xxxxxx 

XX 

DT 

XX 

DE' Sequence 3, Application OS/08148209A 
XX 

CC Sequence 3, Application US/08148209A 

CC Patent No. 5556780 

CC GENERAL INFORMATION; 

CC APPLICANT: Dzau, Victor J 

CC APPLICANT: Mukoyama, Masashi 

CC TITLE OF INVENTION; TYPE-2 ANGIOTENSIN II RECEPTOR AND GENE 

CC NUMBER OF SEQUENCES; 15 

CC CORRESPONDENCE ADDRESS; 

CC ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON i HERBERT 

CC STREET; 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 



Best Available Copy 

,&un Jan 24 11:29:00 1999 US-09-162-597-3.rag 



>****♦****♦**********( 



1 \ / / 1 1 _ 1 1 _l 

l\V/l 1 1 1 1 1 1 1 
1 VV 1 1 1 l_l 1 1 l_ 
1 Ml 1 l_ 1 


1 _ 1 1 _l 
M M M 
M_ll M 
1 _l M 


1 1 1 
1 1 1 
1 !_! 


1 MM M 
1 MM _M 
1 Mil II 


M\\ M 
WW M_ 

M \\ 1 1 


1 l~l 

1 1 1 
1 1 1 



(IM) 



Release 3.1A John F, Collins, Biocomputing Research Unit. 
Copyright (c) 1993-1998 University of Edinburgh, O.K. 
Distribution rights by Oxford Molecular Ltd 

MPsrch_pp protein ■ protein database search, using Smith-Waterman algorithm 

Run on: Frt Jan 22 17:'58:2r 1199;"" Mas Paftiiel'.'orsrcMds 

492,876 Million cell updates/sec 

Tabular output not generated. 



Title: 

Description: 
Perfect Score: 



Scoring table; 
Searched: 



>DS-09-162-597-3 

(1-245) from [JS09162597.pep 

1940 

1 MTLFHFGNCFALAYFPYFIT GLVSOTLMYLFPASLQVLVR 245 

PAM 150 
Gap 11 

131922 seqs, 16180660 residues 



Post-processing: Minimum Match 0% 

Listing first 45 summaries 

Database: a-geneseq32 

1: parti 2:part2 3;part3 4:part4 5;part5 6;part6 7: part? 
8;part8 9:part9 10:partlO ll:partll 12:partl2 13:partl3 
14:partl4 15;partl5 16;partl6 17:partl7 18:partl8 
19;partl9 20;part20 21:part21 22:part22 23:part23 
24:part24 25;part25 25:part25 27:part27 28:part28 
29:part29 

Statistics: Mean 33.119; Variance 147.508; scale 0.225 

Pred. No, is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Query 



No. 


Score 


Match Length 


DB 


ID 


Description 


Pred. No, 


PI 

' ■• DR 
DR 


1 


99 


5.1 


237 


22 


W20142 


H, pylori inner membr 


l,14e+01 


2 


98 


5.1 


422 


21 


R97722 


Mouse inositol polyph 


l,34e+01 


PT 


3 


98 


5.1 


497 


22 


W20821 


H, pylori cell envelo 


1.34e+01 


PT 


4 


95 


4.9 


1353 


19 


R99251 


Murine adenylate cycl 


2,17e+01 


PT 


5 


94 


4.8 


772 


15 


R89264 


Heparinase-ll. 


2.55e+01 


PS 


. 6 


94 


4.8 


843 


13 


R67760 


Lys-aminopeptidase Pe 


2,55e+01 


CC 


7 


91 


4.7 


359 


22 


W20927 


H. pylori surface or 


4.11e+01 


CC 


8 


92 


4.7 


521 


21 


W14445 


CarA gene product. 


3.51e+01 


CC 


9 

10 


89 


4,6 


349 


19 


W02722 


G-protein coupled mou 


5,54e+01 


CC 


89 


4,6 


349 


16 


R48750 


G-protein coupled mou 


5.54e+01 


■CC 


11 


87 


4,5 


174 


3 


R21412 


NADH dehydrogenase 6. 


7.70e+01 


CC 


12 


87 


4,5 


293 


21 


W14481 


Ranp-1, 


7,70e+01 


• CC 


13 


87 


4,5 


417 


16 


R79929 


Porcine acylglucosami 


7.70e+01 


CC 


14 


88 


4.5 


558 


2 


P70429 


Murine neuroleukin. 


6.59e+01 


CC 


15 


85 


4.4 


152 


29 


W48271 


Rat ninjurin 1, 


,1.05e+02 


CC 


16 


85 


4,4 


252 


20 


W01210 


Serine protease PfSPl 


1.05e+02 


CC 


17 


85 


4,4 


274 


1 


R06836 


Intercellular adhesio 


1.05e+02 


CC 


18 


85 


4.4 


1477 


22 


W10424 


Saccharomyces cerevis 


1.05e+02 


,cc 



19 85 




R57691 


S. cerevisiae scaur2R 


1 (Tie+ni 


20 86 -■ 


4;4 1989 18" 




•"•'Peripheral' 'nervous 'sy*" 


■■8",'99e+0i 


21 86 


4,4 4473 20 


R97244 


virulence gene cluste 


8,99e+01 


22 84 


4.3 236 1 


P81992 


Hepatitis B viral sur 




23 84 


4,3 316 19 


nU^oS / 


G -protein coupled rat 


1 09e+n5 








G-protein coupled rat 




25 g3 


4,3 346 21 




WHY core-surface fusi 


1.42e'i"02 


26 83 


4,3 363 13 


R66934 


Mouse AT2 receptor. 


1.42e+02 


27 83 


4.3 380 11 


R53750 


OCVCli LLallSlllCllUJiallC L 




28 83 


4.3 391 29 


W38453 


niiU Lypc ilLUUdU ilUlUUi 




29 84 


4.3 422 22 




H. pylori transmembra 


1 99o+n9 


30 84 




W20934 


H. pylori surface or 


l,22e't"02 


31 84 


4 3 494 28 


W42996 


Putative mature potas 


1 . 22e'''02 


32 84 


4.3 494 16 


R90765 


Human K^* channel 2 ma 


1.22e't'02 


33 84 


,1 1 1101 10 

4. J liii 


W34536 


Nudaurelia beta virus 




34 84 


4.3 1233 29 


W41935 


Nudaurelia beta-like 


i.^^e+uj 


35 84 


4. J ZjoO ^0 




Yeast checkpoint cont 








R29527 


Hcv antigen i/wi ju. 


1.22e+02 


37 84 


4.3 3011 4 


R22154 


NANBV Hutch c59 isola 


1.22e-t-02 




'4!3-'3085'2V 


Wl970r" 


"ATM mutant G9170C. 


l!42e+02 


39 82 


4,2 226 7 


R33252 


HBsAg encoded by pGPD 


1.66e+02 


40 82 


4,2 250 2 


R11496 


RPl42/HBsAg. 


1.66e+02 


41 82 


4.2 351 21 


W09047 


Plasniid pHBV DN inser 


l,66e+02 


42 82 


4.2 558 2 


P70430 


Human neuroleukin. 


l,66e+02 


43 82 


4.2 658 27 


W27666 


Streptococcus pneumon 


l,66e+02 


44 82 


4,2 724 27 


W36797 


Novel human gene, des 


l,66e+02 


45 82 


4,2 898 21 


W14777 


Granulosis virus infe 


l,66e+02 



ALIGNMENTS 

RESULT 1 

ID W20142 standard; protein; 237 AA, ■ 
AC W20142; 

DT 08-JOL-1997 (first entry) 
DE H. pylori inner membrane protein, 144 554 61. aa. 
KW Cytoplasmic; vaccine; prevention; treatment; infection; identification; 
KW binding compound; bacterium; life cycle; activator; bacteria; inhibitor; 
KW duodenal ulcer disease; chronic gastritis; diagnosis; envelope. 
OS Helicobacter pylori. 

Key Location/Qualifiers 
miscjifference 3 

"encoded by YCC" 



/note- 
miscjifference 130 

/note- "encoded by RCG" 
miscjifference 183 

/note- "encoded by GSY" 

WO9640893-A1. 
19-DEC-1995. 

06- JON-1996; 009122. 

07- JON-1995; US-487032. 
Ol-APR-1996; OS-630405, 
(ASTR ) ASTRA AB, 

Berglindh OT, Smith D, Mellgaerd BL; 

WPI; 97-052306/05. t 
N-PSDB; T67385, 

Helicobacter pylori nucleic acid sequences and related 
polypeptide(s) - useful for vaccines to treat or prevent H. pylori 
infection, and to detect Helicobacter 
Claim 56; Pages 365; 1481pp; English. 

The present sequence is a Helicobacter pylori inner membrane protein. 
The protein may be used in a vaccine to prevent or treat H. pylori 
infection or to identify H. pylori polypeptide binding compounds, 
,useful as potential H. pylori life cycle activators or inhibitors. 
The genomic sequence of H. pylori (ATCC 55679) was determined from 
overlapping contigs generated by mechanically shearing the bacterial 
DNA. The sequences were analysed for ORF of at least 180 nucleotides, 
and the predicted coding regions defined by computer evaluation, To 
identify likely H. pylori antigens for vaccine development, the aminO; 
acid sequences predicted from various ORF were analysed for significant 
homology to other known or exported membrane proteins, Having identified 
and determined the sequences of interest, particular regions can be 
isolated from H. pylori by PCR amplification for recombinant polypeptide 
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CC production, e.g. in E. coli hosts. 
SQ Sequence 237 AA; 

Query Match 5.1%; Score 99; DB 22; Length 237; 

Best Local Similarity 25.2%; Pred. No, l,14e+01; 

Matches 32; Conservative 30; Mismatches 54; Indels 11; Gaps 9 

Db 23 vglcsylligfwyhk-ksannasieafvmnritdlgmlnigiilif-wnfgtlqykevfsm 80 

I II I::: : I |:: ; |:; |:|: |; :| j ;|| : ; 

Qy 45 VOLCKMLFLATFFPTWEGGIYDFIGEFMKASV-DVADLIGLNLVMSRNAGKGEYRIMVAA 103 

Db 81 Innadysmlfyisvflfigamgksaqfpinhtwlananiegptpvsalihattmvtagvyli 140 

II:: : I :|l I :| I I : : | | : | : : | | : | , 
Qy 104 LGWATAELIMSRCIPLWVGARG"IEF--D-WKyiQMSIDSNIS-LVH-YIVASAQVWMI 156 

Db 141 iranply 147 

I : II 
Qy 157 TRYD-LY 152 



RESOLT 2 

ID R97722 standard; Protein; 422 AA. 

AC R97722; 

DT 29-APR-1997 (first entry) 

DE Mouse inositol polyphosphate binding protein IP4-BP. 

KW Synaptogainin; antagonist; inhibitor; neurotransmitter; hormone; 

KW calcium release; inositol; polyphosphate; pentakisphosphate; 

KW hexakisphosphate; tetrakisphosphate; binding protein; IP4-BP; 

KW PCR primer; polymerase chain reaction; murine. 

OS Mus musculus. 

FH Key Location/Qualifiers 

FT peptide 315.. 34 6 

FT /label- synaptogamin_ll 

FT /note- "part of C2A domain" 

PN J08092290-A. 

PD 09-APR-1996. 

PF 20-SEP-1994; 252942. 

PR 20-SEP-1994; JP-252942. 

PA (SOSE-) SOSEI KK, 

DR WPI; 95-236100/24. 

DR N-PSDB; T29743. 

PT Inositol poly: phosphate (IPP)-binding peptide • useful as an 

PT inhibitor of calcium, neuro; transmitter release or hormone release 

PT ,and for identifying IPP antagonists 

PS Example 8; Page 14-15; 19pp; Japanese. 

CC Degenerate primers (see T29741 and T29742) were designed based on 

CC peptide fragments from the inositol polyphosphate binding protein 

CC IP4BP isolated from ddY mouse cerebellum. The primers were used for 

CC ^PCR amplification of the mouse IP4BP gene (see T29743). The C2A 

CC domain at the C-terminal end of IP4BP contains the synaptogamin II 

CC peptide. A consensus sequence was obtained from a comparison of ■ 

CC synaptogamin peptides from a variety of species . Peptides comprising 

CC the consensus motif (see R97721) are able to bind to inositol 

CC tetrakisphosphate, inositol pentakisphosphate and inositol 

CC hexakisphosphate; such peptides can inhibit neurotransmitter release ; 

CC and hormone release and are calcium inhibitors. 

SQ Sequence 422 AA; 



Query Match 5.1%; 
Best Local Similarity 48.5%; 
Matches 16; Conservative 



Score 98; DB 21; Length 422; 
Pred. No. 1.34e+01; 
6; Mismatches 8; Indels 3; 



Db 62 plppwa-liamavvaglllltccf-cickkccc 92 

:| :ll I I 111:111 I I :|::| I 
Qy 191 SLGSWARLDARAWTGLLALKH-FGPVCRRCQC 222 



ID W20821 standard; Protein; 497 AA. 

AC W20821; 

DT 16-JUL-1997 (first entry) 

DE H. pylori cell envelope inner membrane protein Ilcell603orfl6, 

KW Cytoplasmic; vaccine; prevention; treatment; infection; identification; 



KW binding compound; bacterium; life cycle; activator; bacteria; inhibitor; 

KW duodenal ulcer disease; chronic gastritis; diagnosis; envelope. .. 

OS Helicobacter pylori. 

PN WO9540893-A1. 

PD 19-DEC-1996. 

PF 06-JUN-1996; D09122. 

PR 07-JON-1995; aS-487032. 

PR Ol-APR-1996; US-630405. ■ 

PA (ASTR ) ASTRA AB. 

PI Berglindh OT, Smith D, Mellgaerd BL; 

DR WPI; 97-052306/05. 

DR N-PSDB; T68074. 

PT Helicobacter pylori nucleic acid sequences and related 

PT polypeptide(s) - useful for vaccines to treat or prevent H. pylori 

PT infection, and to detect Helicobacter 

PS Claim 56; Page 1226-1227; 1481pp; English. 

CC The present sequence shows a Helicobacter pylori cell envelope 

CC inner membrane protein. 

CC The protein may be used in a vaccine to prevent or treat H. pylori 

CC infection or to identify H. pylori polypeptide binding compounds, 

CC useful as potential H. pylori life cycle activators or inhibitors. 

CC The genomic sequence of H. pylori (ATCC 55679) was determined from 

CC overlapping contigs generated by mechanically shearing the bacterial 

CC DNA, The sequences were analysed for ORF of at least 180 nucleotides, 

CC and the predicted coding regions defined by computer evaluation. To 

CC identify likely H, pylori antigens for vaccine development, the amino 

CC acid sequences predicted from various ORF were analysed for significant 

CC homology to other known or exported membrane proteins. Having identified 

CC and determined the sequences of interest, particular regions can be 
■"CC~"isolaTed'"from H. pylori by PCR amplification for recombinant polypeptide 

CC production, e.g. in E. coli hosts, 
497 AA; 



Query Match 5,1%; 
Best Local Similarity 25.2%; 
Matches 32; Conservative 



Score 90; DB 22; Length 497; 

Pred. No. 1.34e+01; 

30; Mismatches 54; indels 11; 



Db 155 vglcsylligfwyhk-ksannasieafvmnritdlgmlmgiilif-wnfgtlqykevfsm 213 

I II I::: : :: I h: : |:: |:|: |; :| I :|| : : 
Qy 45 VQLCKMLFLATFFPTWEGGIYDFIGEFMKASV-DVADLIGLNLVMSRNAGKGEYKIMVAA 103 

Db 214 Innadysmlfyisvflfigamgksaqfpmhtwlanamegptpvsalihaatmvtagvyli 273 

II:: : I :|ll :| I I : :||:| : ;M :| 
Qy 104 LGWATAELIMSRCIPLWVGARG--IEF--D-WKYIQMSIDSNIS-LVHY-IVASAQVWMI 156 

Db 274 iranply 280 

I : II 

Qy 157 TRYD-LY 162 



ID ■ R99251 Standard; Protein; 1353 AA. 

AC R99251; 

■ DT Ol-DEC-1996 (first entry) 

'.DE ' Murine adenylate cyclase 9. 

KW Adenylate cyclase; calcineurin; cellular metabolism; tumour; 

KW neurological; cardiovascular; endocrine; respiratory; bone; kidney; 

KW pregnancy; gut; therapy. 

OS Mus cookii. 

FH Key Location/Qualifiers 

FT modified.site 27 

FT '' /label- C -kinase 

FT /note= "putative protein kinase C phosphorylation 

FT site" 

FT modified_site 84 

FT /label- CAM.kinase-11 

FT /note= "putative CAM kinase II phosphorylation 

FT site" 

FT modified_site 85 

FT /label- A-kinase 

FT.. , /note- "putative protein kinase A phosphorylation 

FT site" 

FT modified_site 94 
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FT /label- C-kinase 

FT /note= "putative protein kinase C phosphorylation 

FT site" 

FT niodified_site 187 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT modifiedjite 200 

FT /label- CamJinase-II 

FT /note- "putative CAM kinase II phosphorylation 

FT site" 

FT modified_site 201 

FT /label- A-kinase 

FT /note- "putative protein kinase A phosphorylation 

FT site" 

FT niodified_site 261 

FT /label- Casein_kinase-II 

FT /note- "putative casein kinase II phosphorylation 

FT site" 

FT tnodlfied_site 266 " 

FT • /label- C-kinase 

FT ■ /note- "putative protein kinase C phosphorylation 

FT site" 

FT niodified_site 303 

FT /label- CAMjinase-Ii 

FT • /note- "putative CAM kinase II phosphorylation 

FT site" 

FT niodified_site 304 

FT /label- A-kinase 

FT /note- "putative protein kinase A phosphorylation 

FT site" 

FT modifiedjite 356 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT niodified_site 358 

FT /label- CAM_kinase-II 

FT /note- "putative CAM kinase II phosphorylation 

FT site" 

FT niodified_site 370 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT niodified_site 432 

FT /label" Casein_kinase-ll 

FT /note- "putative casein kinase II phosphorylation 

FT site" 

FT niodified_site 436 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT domain 503.. 610 

FT /label- Calcineurinjoindingjomain 

FT niodified_site 581 

FT /label- Casein_kinase-II 

F! /note- "putative casein kinase II phosphorylation 

FT site" 

FT modified_site 588 

FT /label- Casein_kinase-ll 

FT /note- "putative casein kinase II phosphorylation 

FT- site" 

FT modified_site 596 

FT /label- CAM_kinase-ll 

FT /note- "putative CAM kinase II phosphorylation 

FT site" 

FT inodified_site 597 

FT /label- A-kinase 

FT /note- "putative protein kinase A phosphorylation 

FT Site" 

FT modified_site 519 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 



j FT inodified_site 690 

FT -..<- /label- Caseinji-nase-ll ■■■■■ 

FT /note- "putative casein kinase II phosphorylation 

FT site" 

FT inodified.site 719 

FT /label- CAMJinase-II 

I FT /note- "putative CAM kinase II phosphorylation 

FT site" 

FT inodified.site 732 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT iiiodified_site 751 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT inodified_site 763 

FT /label- A-kinase 

/note- _"putative^ protein .kinase A phosphorylation 

""ft " site" 

FT modified_site 765 

ft /label- Caseinjinase-ll 

FT /note- "putative casein kinase II phosphorylation 

FT site" 

FT inodified_site 837 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT inodified.site 885 

FT /label- Casein.kinase-n 

. FT /note- "putative casein kinase II phosphorylation 

FT site" 

FT niodified_site 919 

FT /label- A-kinase 

FT /note- "putative protein kinase A phosphorylation 

FT site" 

FT niodified_site 971 

FT /label- CAMJinase-ll 

FT /note- "putative CAM kinase II phosphorylation 

FT site" 

!■ FT niodified_site 972 

FT /label- A-kinase 

FT /note- "putative protein kinase A phosphorylation 

FT site" 

FT niodified_site 972 

FT /label- C-kinase 

FT' /note- "putative protein kinase C phosphorylation 

FT site" 

FT inodified.site 1002 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT inodified.site 1019 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation j' 

FT site" 

FT modified.site 1002 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT inodified_site 1105 

FT /label- C-kinase 

■ FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT modified_site 1194 

FT /label- CAMjinase-II 

FT /note- "putative CAM kinase II phosphorylation 

FT site" 

FT niodified_site 1199 

FT /label- CaseinJinase-II 

FT /note- "putative casein kinase ii phosphorylation 

FT site" 

FT modifiedjite 1211 
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FT /label" c-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 

FT niodified_site 1212 

FT /label= CAMJinase-ll 

FT /note- "putative CAM kinase 11 phosphorylation 

FT site" 

FT modified_site 1213 

FT /label= A-kinase 

FT /note- "putative protein kinase A phosphorylation 

FT site" 

FT inodified_site 1270 

FT /label- CAMjinase-ll 

FI /note- "putative CAM kinase II phosphorylation 

FT site" 

FT modified.site 1310 

FT /label- Casein_kinase-ll 

FT /note- "putative casein kinase II phosphorylation 

FT site" 

FT modified_site 1310 

FT /label- C-kinase 

FT /note- "putative protein kinase C phosphorylation 

FT site" 



m WO9625502-A1. 

PD 22-AUG-1996. 

PF 14-FEB-1996; G00312. 

PR 14-FEB-1995; GB-002806. 

PR ll-AOG-1995; GB-016528, 

PA (MEDI-) MEDICAL RES CODNCIL, 

PI Antoni F, Paterson JM; 

DR WPI; 96-393403/39. 

DR N-PSDB; T35209. 

Note: remainder of annotations omitted, 

Query Match 4.9*; Score 95; DB 19; Length 1353; 

Best Local Similarity 26.8%; Pred. No. 2.17e+01; 

Matches 15; Conservative 16; Mismatches 23; Indels 2; Gaps 2; 

Db 887 efetnihvtmftgsavlvavvhycnfcqlsswnirsslativgaglllllhis-lcq 941 

;l : : II : : : "I hll I ::| :||| I |:: :|: 
Qy 164 NFRPAVLLLMFLSVYKAFVMETFVHLCSLGSWARLDARAW-TGLLALKHFGPVCR 218 



RESULT 5 

ID R89264 standard; Protein; 772 AA. 

AC R89264; 

DT 07-APR-1996 (first entry) 

DE ,Heparinase-II. 

KW Heparinase-II; heparin degradation; heparan sulphate degradation; 

KW Escherichia coli; polymerase chain reaction; PCR; primer; toxicity; 

KW cloning; vector; ribosome binding site; haemostatic; 

KW blood-clotting; antibody; affinity chromatography, 

OS Flavobacterium heparinura. 



FH Key Location/Qualifiers 

FT peptide 1,.25 
FT /note- "Signal peptide" 

FT peptide 85.. 96 

FT /note- "Peptide 2B (R89267)" 

FT peptide 222,, 233 

FT /note- "Peptide 2A (R89255)" 

FT peptide 500,, 526 
FT /note- "Peptide 2C (R89268)" 



PN W09534635-A1, 

PD 21-DEC-1995. 

PF 09-JON-1995; a07391. 

PR lO-JUN-1994; OS-258639, 

PA (IBEX-) IBEX TECHNOLOGIES, 

PA (ZIMM/) ZIMMERMANN J. 

PI Bennett C, Slain F, Gu K, Musil R, Su H, Zimmermann J; 

DR WPI; 96-097381/10. 

I3R N-PSDB; 099226. 

PT Nucleic acids encoding Flavobacterium heparinum heparinase II and 



PT III - for degrading heparin and heparan sulphate, also related host 

PT. cells., proteins and antibodies, useful in heparinase purificn...™-,..... 

PS Claim 14; Fig 5; 75pp; English. 

CC The sequence corresponds to Flavobacterium heparinum heparinase-ll, 

CC which degrades heparin and heparan sulphate. The sequence is 

CC isolated from a gene library in phage lambda -DASH- I I in Escherichia 

CC coli by polymerase chain reaction amplification using e,g, primers 

CC 2-1 to 2-4 (Q99228-Q99231), which are designed based on the 

CC sequences of peptides 2A and 2B. Toxicity of the gene and natural 

CC selection of the host against clones with the entire sequence has 

CC been circumvented by cloning sections of the gene separately in a 

CC vector with a modified ribosome binding site, which increases 

CC expression levels. The heparinase-II may be used to neutralise 

CC anticoagulant activity. Antibodies against the protein may be used 

CC to differentiate between native and recombinant enzpes, and when 

CC immobilised they may be used for heparinase purification by 

CC affinity chromatography. 

SQ Sequence 772 AA; 

Query Match 4.8%; Score 94; DB 15; Length 772; 

Best Local Similarity 26.1%; Pred, No, 2,55e+01; 

Matches 23; Conservative 25; Mismatches 35; Indels 5; Gaps 3; 

Db 94 dfrfyfnqkgltvrvelmalnylratkdpkvgreaitsiidtletatfkpagdisrgiglf 153 

^..1 -.1 :; |. :|::|.| ■ .:: :| II : :|| I I 

Qy 66 DFIGEFMKASVDV-ADLIGLNLVMSRNA-GKGEYKIMVAALGWAT--AELIMSRCIPLW 120 

Db 154 mvtgaivydwcydqlkpeektrfvkafv 181 
: : :| ;!! I I: : : :| :| ■ 
Qy 121 VGARGIEFDWKnQMSIDSNISLVHYIV 148 



RESULT 6 

ID R67760 standard; Protein; 843 AA. 

AC R67760; 

DT Ol-AUG-1995 (first entry) 

DE Lys-aminopeptidase PepN. 

KW Lys-aminopeptidase; PepN; fermented food; cheese. 

OS Lactobacillus delbrueckii subsp, lactis. 

PN EP-633316-A, 

PD ll-JAN-1995. 

PF 30-JDN-1994; 401497. 

PR 01-J[JL-1993; GB-013586. 

PA (EECE-) EEC EURO . ECONOMIC COMMUNITY. 

PI Klein JR, Plapp R; 

DR WPI; 95-038513/06, 

DR N-PSDB; Q79913, 

PT Purified Lys-aminopeptidase PepN enzyme and PepN gene ■ useful 

PI for prepn, of fermented foodstuff, esp. cheese, 

PS Claim 3; Page 17-21; 41pp; English, 

CC A new Lys-aminopeptidase, PepN, was isolated from L, delbrueckii 

CC subsp. lactis WS87 (DSM 7290) and had the sequence given in ; • , 

CC R67760, The pepN gene (Q79913) was isolated from a library 

CC of DSM 7290 chromosomal • DNA by screening for peptolytic activity 

CC in Escherichia coli ER1562 trans formants. The isolated gene 

CC is used for recombinant PepN production , 

SQ Sequence 843 AA; 

Query Match 4,8%; Score 94; DB 13; Length 843; 

Best Local Similarity 21,1%; Pred. No, 2,55e+01; 

Matches 8; Conservative 15; Mismatches 14; Indels 1; Gaps 1; 

Db 751 ewawlektvggdraefatfitvisrvfktkerydeynaf 798 

:| ;:; : :| ;:| : ||| |: | 

Qy' 129 DWKYIQMSIDSHISLVHYIVASAQVWMIT-RYDLYHNF 165 



I'' RESULT 7 

ID W20927 standard; protein; 359 AA, 
I AC .W20927; 

I DT 21-JUL-1997 (first entry) 

; DE H, pylori surface or membrane protein, 16ael0508orf3. 

KW Cytoplasmic; vaccine; prevention; treatment; infection; identification; 
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KW binding compound; bacterium; life cycle; activator; bacteria; inhibitor; 

KW duodenal ulcer disease; chronic gastritis; diagnosis; envelope. .. 

OS Helicobacter pylori, 

PN WO9640893-A1, 

PD 19-DEC-1996. 

PF 06-JaN-1996; D09122. 

PR 07-JUN-1995; DS-487032. 

PR Ol-APR-1996; DS-630405. 

PA (ASIR ) ASTRA AB. 

PI Berglindh OT, Smith D, Mellgaerd BL; 

DR WPI; 97-052306/05. 

DR N-PSDB; W20927. 

PT Helicobacter pylori nucleic acid sequences and related 

PT polypeptide (s) • useful for vaccines to treat or prevent H. pylori 

PT infection, and to detect Helicobacter 

PS Claim 73; Page 132M325; 1481pp; English. 

CC The present sequence is a H. pylori surface or membrane protein likely 

CC to contain seven or more membrane spanning regions, 

CC The protein may be used in a vaccine to prevent or treat H, pylori 

CC infection or to identify H. pylori polypeptide binding compounds, 

CC useful as potential H. pylori* life- cycle activators -or -inhibitors . 

CC The genomic sequence of H. pylori (ATCC 55679) was determined from 

CC overlapping contigs generated by mechanically shearing the bacterial 

CC DNA, The sequences were analysed for ORF of at least 180 nucleotides, 

CC and the predicted coding regions defined by computer evaluation. To 

CC identify likely H. pylori antigens for vaccine development, the amino 

CC acid sequences predicted from various ORF were analysed for significant 

CC homology to other known or exported membrane proteins . Having identified 

CC and determined the sequences of interest, particular regions can be 

CC isolated from H, pylori by PCR amplification for recombinant polypeptide 

CC production, e,g. in E. coll hosts, 

SQ Sequence 359 AA; 

Query Match 4.7%; Score 91; DB 22; Length 359; 

Best local Similarity 34.3%; Pred. No. 4,lle+01; 

Matches 12; Conservative 13; Mismatches 7; Indels 3; Gaps 3; 

Db 242 slvp-avggaliwipiaiyelyhghvneaifivly 275 

III |::| :|; |: |:||| : |:;:::: 
Qy 142 SLVHYIVASAOVWM-ITRYDLYH-NFRPAVLLLMF 174 



RESULT 8 

ID W14445 standard; Protein; 521 AA, 

AC W14445; 

DT 17-MAY-1997 (first entry) 

DE CarA gene product. 

KW carbapeneni; carR; carA; carB; carC; carD; carE; carF; carG; carH; 

KW biosynthesis; antibiotic; beta- lactamase inhibitor; regulatory, 

OS Erwinia carotovora. 

PN W09532294-A1. 

PD 30-NOV-1995, 

PF 18-MAY-1995; G01125, 

PR 20-MAY-1994; GB-010142. 

PA (DYNO-) ONIV NOTTINGHAM, PA (DYWA-) UNIV WARWICK, 

PI. Bycroft BW, Cox ARJ, Holden MTG, Mcgowan SJ, Porter LE; 

PI Salmond GPC, Sebaihia M, Stewart GSA, Williams P; 

DR WPI; 96-020587/02. 

DR N-PSDB; T09700. 

PT New genes encoding enzymes involved in carbapenem biosynthesis • 

PT , useful for isolating other carbapenem synthesising genes and 

PT producing carbapenem in heterologous organisms 

PS Disclosure; Fig 4; 31pp; English. 

CC W14445-52 are the protein products of the car genes (carA-H) which are 

CC involved in the biosynthesis of carbapenem, an antibiotic which acts as 

CC a beta -lactamase inhibitor. The car biosynthetic genes are regulated by 

CC the carR gene product (W14444), a positive activator of the biosynthetic 

CC genes which functions in trans, The car genes can be used to identify 

CC homologous genes in bacteria and fungi, while products of the genes may 

CC be used to raise antibodies used to detect clones (in a Streptomyces 

CC library) that express cross -reactive protein, 

SQ Sequence 521 AA; 



Query Match 4.7%; Score 92; DB 21; Length 521; 

. .... .Best-Loeal-Similar-ity 36.-4%';- Pred.--NO;-"3,51e+01; • - 

Matches 12; Conservative 6; Mismatches 14; indels 1; Gaps 1; 

Db 64 kcemergtayligslynrtfliglagvwegeay 96 

II :: I :ll: I : II : III I 
Qy 34 KC-VQAGVTYLFVQLCKMLFLATFFPTWEGGIY 65 



RESULT 9 

ID W02722 Standard; peptide; 349 AA, 

AC W02722; 

DT 13-NOV-1996 (first entry) 

DE G-protein coupled mouse glucocorticoid-induced receptor. 

KW G-protein coupled receptor; ligand binding assay; transmembrane domain; 

KW schizophrenia; dopamine; cAMP; adenosine; thrombin; adrenergic; opsin; 

KW muscarinic acetylcholine; endothelin; bombesin; endocrine; rhodopsin; 

KW odorant; cytomegalovirus; serotonergic, 

OS Mus musculus, 

PN OS5508384-A. 

PD 15-APR-1996. 

PF lO-SEP-1992; 943236, 

PR lO-SEP-1992; US-943235. 

PR 09-SEP-1993; US-118270, 

PA (OYNY ) UNIV NEW YORK STATE, 

PI Murphy RB, Schuster Dl; 

DR WPI; 96-208785/21, 

PT New dopamine receptor peptide - useful as antipsychotic agent, e.g. 

PT for treating schizophrenia 

PS Disclosure; Column 163-166; 184pp; English, 

CC Proteins W02657-W02730 represent a range of G-protein coupled receptor 

CC (GPR) proteins selected from cAMP, adenosine, muscarinic acetylcholine, 

CC adrenergic, thrombin, endothelin, bombesin, endocrine, rhodopsin, opsin,' 

CC odorant, cytomegaloviral and other GPR proteins. The receptor proteins 

CC were used to design polypeptides, pref . based on the transmembrane 

CC domains, for use in G-protein coupled receptor ligand binding assays. 

CC The polypeptide fragments retain biological activity such as binding a 

CC GPR ligand or modulating GPR ligand binding to a GPR (see W02747-W02999 ' 

CC for examples of polypeptide fragments), The polypeptide fragments can 

CC be used in compositions for treating subjects suffering from a pathology 

CC related to a GPR abnormality e,g, a psychotic disorder such as 

CC schizophrenia, 

SQ Sequence 349 AA; 

Query Match 4,5%; Score 89; DB 19; Length 349; 

Best Local Similarity 20,5%; Pred, No. 5.64e+01; 

Matches 18; Conservative 29; Mismatches 38; indels 3; Gaps 3; 

Db 94 aiavdrhqvimplkprisitkgviyiaviwvmtffslphaicqklftfkysedivrslcl 153 

:| I: ; I:; : II; :; I :|::| : || : :: :;: : :;: : 
Qy 125 GIEFDWKYIQMSIDSNISLVHYIVASAQVWMITRYDLYHNFRPAVLLLMFL-SVYKAFVM 183 

Db 154 dpfpepadl-fwkyldiatfillyllpl 180 

: I Mil:;: ll:| 
Qy 184 ETFVHLCSLGSWARLDARA-WTGLLAL 210 

' I' 
RESULT 10 

ID R48750 standard; Protein; 349 AA. 

AC R48750; 

DT 07-JUN-1996 (first entry) 

DE G-protein coupled mouse glucocorticoid-induced receptor protein. 

KW G-protein coupled receptor; ligand binding assay; transmembrane domain; 

KW psychotic disorder; schizophrenia; dopamine; cAMP; adenosine; thrombin; 

KW muscarinic acetylcholine; adrenergic; endothelin; bombesin; endocrine; 

KW rhodopsin; opsin; odorant; cytomegalovirus, 

OS Mus musculus, 

PN WO9405695-A1, 

PD 17-MAR-1994. 

PF 09-SEP-1993; U08528, 

PR lO-SEP-1992; nS-943236. 

PA (UYNY ) UNIV NEW YORK STATE. 

PI Murphy RB, Schuster DI; 
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DR WPI; 94-101120/12, 

PT Polypeptides of G-coupled receptor proteins (GPRs) - useful for 

PT binding GPR ligands or modulating GPR "binding 

PS Disclosure; Page 125-126; 160pp; English, 

CC Proteins R48585-R48758 represent a range of G-protein coupled receptor 

CC proteins selected from cAMP, adenosine, muscarinic acetylcholine, 

CC adrenergic, thrombin, endothelin, bombesin, endocrine, rhodopsin, opsin, 

CC odorant, cytoniegaloviral and other G-protein coupled receptors. The 

CC- receptor proteins were used to design polypeptides, pref . based on the 

CC transmembrane domains, for use in G-protein coupled receptor ligand 

CC binding assays. The polypeptide fragments retain biological activity 

CC such as binding a GPR ligand or modulating GPR ligand binding to a GPR 

CC (see R48759-R48758, R50569-R50B07 and R89189-R89195 for examples of 

CC- polypeptide fragments). The polypeptide fragments can be used in 

CC compositions for treating subjects suffering from a pathology related to 

CC a GPR abnormality e.g, a psychotic disorder such as schizophrenia. 

SQ Sequence 349 AA; 

Query Match 4.6%; Score 89; DB 16; Length 349; 

Best Local Similarity 20.5%; Pred, No, 5.64e+01; 

Matches 18; Conservative 29; Mismatches 38; Indels 3; Gaps 3; 

Db . 94 aiavdrhqviniplkprisitkgviyiaviwvmtffslphaicqklftfkysedivrslcl 153 
:| I: : h; : II; :: I :|:;| : II : ;; ;:; : ;;; : ■ 
Qy 125 GIEFDWKYIQMSIDSNISLVHYIVASAQVWMITRYDLYHNFRPAVLLLMFL-SVYKAFVM 183 

Db 154 dpfpepadl-fwkyldiatfillyllpl 180 

: I Mil: ||.:.| - 

Qy 184 ETFVHLCSLGSWARLDARA-WTGLLAL 210 



RESOLT 11 

ID R21412 standard; Protein; 174 AA. 

AC R21412; 

DT Ol-APR-1992 (first entry) 

DE NADH dehydrogenase 6 , 

RW Pneumonia; assay; AIDS; immunosuppressed. ' 

OS Pneumocystis carinii, 

PN WO9119005-A. 

PD 12-DEC-1991. 

PF 31-MAY-1991; G00869. 

PR Ol-JON-1990; 6B-012196, 

PA (ISIS-) ISIS INNOVATION LTD. 

PI JJakefield ae, Hopkin JM, Moxon ER; 

DR WPI; 92-007487/01, 

DR N-PSDB; Q20065. 

PT New DNA sequences which act as oligonucleotide primers - for 

PT assaying DNA sample from respiratory secretion of a patient 

PT infected with P carinii 

PS Claim 3; Fig 3; 42pp; English. 

CC The amino acid sequence is that of P, carinii nadh dehydrogenase 

CC 6 which was translated from DNA from pIasmid'pAZ112. See also 

CC R21409-R21413 and R20056. 

SQ Sequence 174 AA; 

Query Match 4.5%; Score 87; DB 3; Length 174; 

Best Local Similarity 31,0%; Pred, No. 7.70e+01; 

Matches 13; Conservative 14; Mismatches 14; Indels 1; Gaps 1; 

Db 99 sslvftlylslpreiimekyshlfsyfswenrilnpsvveil 140 

::::: "Ih : ::|l ; III II |; ;|| I 
Qy 167 PAVLLLMFLSVYKAFVMETFVHLCSLGSWA-RLDARAWTGL 207 



RESULT 12 

ID W14481 standard; Protein; 293 AA. 

AC W14481; 

DT 28 -MAY- 1997 (first entry) 

DE Ranp-1. 

KW ranp-1; 5' -nucleotidase consensus sequence; nervous damage; 5'-AMP; 

KW treat; improved activity; organ; 5' -adenosine monophosphate, 

OS Rattus rattus, 

PN J09040697-A. 



PD lO-FEB-1997, 

' PF 27-^DL-1995; 192082, 

PR 27-JOL-1995; JP-192082, 

PA (SHIO ) SHIONOGI S CO LTD. 

DR WPI; 97-175708/16. 

DR H-PSDB; T62693, 

PT Protein containing 5 '-nucleotidase consensus sequence which 

PT specifically binds AMP - is useful for treatment of nervous diseases 

PS Claim 2; Page 10-11; 13pp; Japanese, 

CC The sequence is that of ranp-1 which contains a 5 '-nucleotidase consensus 

CC sequence. The DNA expression is enhanced when there is nervous damage. 

CC The protein enhances 5'-niicleotidase activity and binds specifically 

CC with 5' -AMP. Nervous diseases can be treated and the activity of various 

CC organs can be improved by controlling the expression of the DNA and the 

CC protein , 

SQ Sequence 293 AA; 

Query Match 4,5%; Score 87; DB 21; Length 293; 

Best Local Similarity 26,2%; Pred. No, 7.70e+01; 

Matches 16; Conservative 17; Mismatches 21; Indels 7; Gaps 5; 

Db 64 Ifslpg-flfqfipfmrkykiqkdkpetfegqwkclkgilfnhffiqlplicgtyyftef 122 

II ; I : :;|: II :| ::: III; ; ; ;|;|| I ;:: | 
Qy 3 LFHFGNCFALAYFPYFITYKC-SGLSE-YNAFWKCVQAGV-TYLFVQL--CKMLFLATF 56 



Db 


123 f 123 






..Qy... 


1 

-.57 F 57 






RESOLT 13 






ID 


R79929 standard; Protein; 417 AA. 




AC 


R79929; 








09-MAY-1995 (first entry) 




DE 


Porcine acylglucosamine-2-epimerase mutant. 




KW 


Porcine; acylglucosamine-2-epimerase; N-acetylmannosainine; 


KW 


N-acetylneuraminic acic 


; renin -binding; enzymatic production; 


KW 


mutant. 






OS 


Sus scrofa. 






FH 


Key Location/Qualifiers 




FT 


raiscjifference 23 






FT 


/note- 


"wild type Met substd. 


with Val" 


FT 


misc.difference 27 




FT 


/note- 


"wild type Leu substd. 


with Met" 


FT 


miscjifference 33 






FT 


/note- 


"wild type Arg substd. 


with Gin" 


FT 


miscjifference 45 




FT 


/note- 


"wild type Asp substd. 


with Glu" 


FT 


raisc_difference 71 




FT 


/note- 


"wild type Lys substd. 


with Thr" 


FT 


misc_difference 72 




FT 


/note- 


"wild type Leu substd. 


with Phe" 


FT 


miscjifference 76 




■ft' 


/note- 


"wild type His substd. 


with Arg" 


FT 


miscjifference 77 • 




FT 


/note- 


"wild type Arg substd, 


with His" 


FT 


misc.difference 78 




FT 


/note- 


"wild type Pro substd. 


with Ala" 


FT 


miscjifference 79 




FT 


/note- 


"wild type Glu substd. 


with Gin" 


FT 


misc.difference 94 • 




FT 


/note- 


"wild type His substd. 


with Tyr" 


FT 


misc.difference 101 • 


■ FT 


/note- 


"wild type Glu substd, 


with Gly" 


FT 


raiscjifference 120 


FT 


/note- 


"wild type Ser Substd. 


with Thr" 


FT 


miscjifference 137 




FT 


/note- 


"wild type Val substd. 


with Ala" 


FT ' 


" miscjifference 139 






FT 


/note- 


"wild type Ala substd, 


with Gly" 


FT . 


raiscjifference 141 


FT 


/note- 


"wild type Ala substd. 


with Val" 


FT 


miscjifference 145 
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"wild 


type 


Ser substd. 


witn inr 


FT 


m sc_a 


Lisrsncs 14!; 














"wild 


type 


Asp SUDStd. 


witn \3iu 


FT 


niisc 'di 


ffsrcncs 159 






FT 






"wild 


type 


ni^ oUJJoLU. 


WlLll 


FT 


niisc dj 












FT 






"wild 


type 


rlU OUUOLU. 


WiLll flia 


FT 


inisc di 


Eferonce 171 










FT 






"wild 


type 


Pro substd. 


WiLIl uill 


FT 


- 


ttcr6nc6 1/4 














"wild 


type 


Val SuDStQ. 


with Pro 




, ,, 
iniscji 


Ll6rGnC6 17d 












/notS" 


wild 


type 


Ser substd. 


with Ala 




niiscjj 


Eference 178 








rm 




/note= 


wild 


type 


Ser substd. 


with Pro" 


pi 


iiilsc_dj 


Eferencfi 187 








rl 




/note" 


wild 


type 


Cys substd. 


with Asn" 




iiiisc_dj 


Eference 195 






FT 




/note* 


wild 




Glu substd. 


with Ala" 




miscji 


Eferencs 202 










FT 


' ,, 




wild 


type 


Arg substd. 


with Lys" 




misc.ai 


Lter€Dce 205 












/note" 


wild 


type 


Gin substd. 


with Glu" 


11 


iiiisc_dj 


Eference 208 








n! 




/note" 


"wild 


type 


His substd. 


with Asp" 


FT 


'inisc_d] 


Eference 232 










/note" 


"wild 


type 


Asp substd. 


with Gly" 


PT 


miscji 


Eference 234 




PT 




/note° 


wild 


type 


Glu substd. 


with Lys" 


FT 


niisc_aJ 


Eference 237 












wild 


type 


Ser substd. 


with Pro" 


FT 


- 


Lierence 24 J 








FT 






"wild 


type 


□la sUDaLU. 


with Gin" 


FT 


wise di 
















"wild 


type 


Aid SluDStCli 


with Thr" 


FT 


- 












FT 






"wild 


type 


Ser substd. 


with cys 


FT 


misc d^ 


fference 251 






FT 






„ 


type 


ser suDstu. 


with He" 




_ 










FT 






„ ... 


type 


Ser substd. 


with Lys 


FT 




Lierence 2bb 








FT 




ff {c?^^" 


" "1/1 
wild 


type 


Ala substd. 


with Pro 




iHlSCjl 


tierence 267 








PT 




/note' 


wild 


type 


Lys substd. 


with Glu" 


pi 


niisc_di 


Eference 275 










/note" 


wild 


type 


Thr substd. 


with Lys" 




niisc_d] 


Eference 282 








PT 




/note" 


wild 


type 


Arg substd. 


with His" 


pm 


misc.di 


fference 287 






FT 




/note" 


"wild 


type 


Ala substd. 


with Pro" 




niisc_di 


fference 300 








PT 




/note" 


"wild 


type 


Gly substd. 


with Asn" 


PT 


- 


Eference 301 






PT 
pl 




/note" 


wild 


type 


Leu substd. 


with Phe" 




iiiisc_di 


fference 329 








PT 
pi 




/note" 


"wild 


type 


Glu substd. 


with Asp" 




niisc_d] 


fference 334 








FT 
pi 






wild 


type 


Ala substd. 


with Val" 




nilsc_di 


fference 363 








p 




/note" 


"wild 


type 


Asn substd. 


with Ser" 


FT 


niisc_di 


fference 371 








FT 






"wild 


type 


Thr substd. 


with Ser" 




- 


fference 395 










•FT 




/note" 


"wild 


type 


Leu substd. 


with Pro" 


FT 


miscji 


fference 395 








FT 




/note" 


"wild 


type 


Ser substd. 


with Gly" 


FT 


misc.di 


fference 401 






FT 




/note" 


"wild 


type 


Leu substd. 


with Pro" 


FT 


miscji 


fference 403.. 41 








FT 




/note" 


"C-terminal addition to the wild type 


FT 




sequence" 








PN 


W095263 


99-Al. 











05-OCT-1995. 

2L-MAR:1995; J00541. r,-- 

25-MAR-1994; JP-056271. 
09-SEP-1994; JP-216333. 
(MARO ) MARUKIN SHOYO KK. 
Maru I, Ohta Y, Tsukada Y; 
WPI; 95-351320/45. 

Recombinant acyl:glucosaniine-2-epimerase with renin-binding activity 
■ useful in enzyniic production of N-acetyl:mannosaraine and 
N-acetyl;neuraminic acid 
Claim 13; Pages 59-61; 74pp; Japanese. 
R79929 is a porcine acylglucosamine-2-epimerase (A2P) mutant, 
with renin-binding activity. A2P can be used for the enzymatic 
prodn. of N-acetylmannosamine and N-acetylneuraminic acid, 
417 AA; 



Query Match 4.5%; Score 07; DB 15; Length 417; 

Best Local Similarity 23,8%; Pred. No. 7.70e+01; 

Matches 15; Conservative 18; Mismatches 26; Indels 4; Gaps 4 

Db 19 Idrvvafwmehshdqehggfftclgregrvyddlkyvwlqgrqvwiycrlyrtferfrha 78 

: I ;::|; ;: I |; : ; | | : |{ | 

Qy 113 MSRCIPLWVG-ARGIEFDWKYIQMSIDSNI-SLVHYI-VASAQVWMITR-YDLYHNFRPA 168 

Db 79 qll 81 

II 

Qy 169 VLL 171 



ID P70429 Standard; protein; 558 AA, 

AC P70429; 

DI 16-JAN-1991 (first entry) 

DE Murine neuroleukin. 

KW Neuroleukin; neural cells; HIV; AIDS; env gene; vaccine. 

OS Mus musculus. 

FH Key Location/Qualifiers 

FT' region 1..436 

FT /label-first exon 

FT region 437.. 0 

FT /label°second exon 

PN WO8707517-A. 

PD 17-DEC-1987. 

PF 04-JON-1987; O01323. 

PR 05-JtJN-1986; US-872332. 

PA (GENE-) Genetics Institute Inc. . 

PI Gurney ME, Knopf JL; 

DR WPI; 87-362709/51. 

DR N-PSDB; N70686. 

PT New neuroleukin protein • useful for culturing neural cells, and for, 

PT treating human iramunodeficiency virus. 

PS Claim 1; page 22; 29pp; English. 

CC Neuroleukin is a single polypeptide chain of apparent mol. wt. 56000 V 

CC 2000 D (by SDS-PAGE), with the ability to maintain 1/2 maximal survival 

CC of spinal or sensory neurons cultured in vitro at a neuroleukin concn|.' of 

CC 0,00000000125 M, and the ability to activate immunoglobulin secretion by 

CC peripheral blood Ipphocytes, Neuroleukin is useful in tissue culture 

CC medium for culturing neural cells and extends the survival of sensory 

CC 'ganglia, brain cells and spinal neurons in culture. A portion of the DNA 

CC sequence has significant homology to portion of the HTLV III/LAV envelope 

CC protein gene, thus may be useful to treat AIDS, or in an AIDS 

CC vaccine. Its treatment of HIV infection is claimed. The protein 

CC is encoded by 2 exons. 

SO Sequence 558 AA; 

Query Match 4,5%; Score 88; DB 2; Length 558; 

Best Local Similarity 29.0%; Pred, No. 6,59e+01; 

Matches 18; Conservative 18; Mismatches 20; Indels 5; Gaps 5; 

Db 269 wggr-ysl-wsaiglsialhvgfdhfeqllsgahwmdqhf-lktpleknapvUallgi 325 

lll;l I I III |; : |; || | : ; | ;|| ;|;: 

Qy 120 WVGARGIEFDWKYIQMSIDSNISLVHYI-VASAQVWMITRYDLYHNF-RPAVLLLMFLSV 177 
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Db 326 wy 327 

I 

Qy 178 -Y 178 



RESULT 15 

ID W48271 Standard; Protein; 152 AA. 

AC W4B271; 

DT 23-JUN-1998 (first entry) 

DE Rat ninjurin 1, 

KW Rat; ninjurin; cellular adhesion molecule; membrane bound; tumour; 

KW nerve injury induced gene; inflammation; nervous system, 

OS Rattus sp, 

P.N WO9803650-A1. 

PD 29-JAN-1998, 

PF 24-JDL-1997; D12210. 

PR 24-jaL-1996; US-672850, 

PA (ONIW ) DNIV WASHINGTON. 

PI Araki T, Milbrandt J; 

DR WPI; 98-120775/11. 

DR N-PSDB; V20565. 

PT New isolated nerve injury induced (ninjurin) gene ■ used to develop 

PT products for treating conditions involving excessive or insufficient 

PT cellular adhesion, e.g, inflainmation or tumours 

PS Claim 5; Fig IB; BOpp; English. 

CC The present sequence represents rat ninjurin 1 (nerve injury induced), 

CC The ninjurin protein (NP) plays a role in axonal regeneration of 

CC peripheral nervous system -.(PNS).. neuronal, cells- after- injury .■- The 

CC products can be used for developing products for treating ninjurin 

CC mediated disorders including conditions involving inappropriate (i.e, 

CC excessive or insufficient) cellular adhesion. Conditions involving 

CC excessive cellular adhesion which may be treated include e.g. 

CC inflammatory diseases such as rheumatoid arthritis, asthma, allergy 

CC conditions, adult respiratory distress syndrome, inflammatory bowel 

CC diseases (e.g. Crohn's disease, ulcerative colitis and regional 

CC enteritis) and ophthalmic inflammatory diseases, autoimmune diseases, 

CC thrombosis or inappropriate platelet aggregation conditions, 

CC arteriosclerosis, reocclusion following thrombolysis, cardiovascular 

CC diseases, some forms of diabetes and neoplastic disease including 

CC metastasis conditions. The products can also be used to promote 

CC cellular adhesion, e.g. in nerve regeneration, wound healing or 

CC prosthetic implantation. The products can also be used for detection, 

CC purification, diagnosis and screening assays. 

SQ _ Sequence 152 AA; 

Query Match 4.4»; Score 85; DB 29; Length 152; 

Best Local Similarity 27, 6»; Pred. No. 1.05e+02; 

Matches 21; Conservative 19; Mismatches 31; Indels 5; Gaps 5; 

Db ' 56 llmanasqlkaweqgnefaf-fvplvvlisislvlqigvgvUiflvkydlnnpakha 113 

I I :| II : ;|||| l|; ; :; ;||| ; : | 

Qy 111 LIMSRCIPLW-VGARGIEFDWKnQMSIDSNISLVHYI-VASAQVWMITRYDLYHNFRPA 168 

Db 114 kldflnnlatglvfii 129 • ■■ 

I :| I; I;: ' ■ 
Qy 169 VL-LLMFLSVYKAFVM 183 ' 



Search completed: Fri Jan 22 17:59:45 1999 
Job time : 84 sees. 
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Release 3.1A John F. Collins, Biocomputing Research Dnit. 
Copyright (c) 1993-1998 University of Edinburgh, U,K, 
Distribution rights by Oxford Molecular Ltd 

MPsrchj)p protein • protein database search, using Smith-Waternian algorithm 

Run on: Fri Jan 22 18:01:16 1999; Malpar Time 15^58' M 

782,839 Million cell updates/sec 

Tabular output not generated, 



Title: 

■Description: 
Perfect Score: 
Seque:)ce: 

Scoring table; 



Searched: 



>US-09-162'597-3 
(1-245) from aS09162597,pep 
1940 

1 MTLFHFGNCFALAYFPyFIT GLVSQILMYLFPASLQVLVK 245 

PAM 150 
Gap 11 

165420 seqs, 49795644 residues 

Post-processing: Minimum Match 0% 

Listing first 45 summaries 

Database: sptrembl6 

l:sp_archea 2:sp_bacteria 3:sp_fungi 4:sp_human 
5:sp_invertebrate 6;spjiairanal 7;spjihc 8:sp_organelle 
9:sp_phage 10:sp_plant ll:sp_rodent 12:sp_unclassified 
13;sp_vertebrate 14:sp_virus 

Statistics; Mean 46.110; Variance 101.374; scale 0,455 

Pred, No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution, 



% 

Query 



No, Score 


Match Length 


DB 


ID 


Description 


Pred. No. 


1 636 


32.8 


216 


5 


Q23483 


COSMID ZK418, 


2.66e-90 


2 119 


6.1 


1087 


5 


046089 


1-BVIDENCE»PREDICTED B 


1.50e-02 


3 113 


5.8 


215 


8 


Q36712 


PETB, PETD (RNA EDITIN 


8.40e-02 


4 113 


5.8 


232 


8 


Q32438 


POT, ALT, PETB GENE PR 


8.40e-02 


• 5 113 


5.8 


232 


8 


Q33298 


ALTERNATE PETB GENE PR 


8,40e-02 


6 113 


5,8 


232 


8 


Q36615 


APOCYTOCHROME B6. 


8.40e-02 


7 109 


5.6 


300 


2 


032730 


ATTA2 PROTEIN. 


2.58e-01 


8 106 


5.5 


271 


8 


063292 


NADH DEHYDROGENASE SUB 


5.88e-01 


9 105 


5.4 


212 


1 


027297 


CONSERVED PROTEIN. 


7.72e-01 


iO 104 


5.4 


310 


2 


051297 


CONSERVED HYPOTHETICAL 


l.Ole+00 


11 104 


5.4 


379 


8 


047977 


CYTOCHROME B, 


l.Ole+00 


12 105 


5.4 


527 10 


064515 


YOP8H12R.2 PROTEIN, 


7,72e-01 


13 104 


5.4 


1410 


5 


Q18259 


SIMILARITY ALONG ENTIR 


l.Ole+OO 


14 103 


5.3 


308 


5 


017615 


C29F3,6. 


1.32e+00 


15 102 


5.3 


379 


8 


047963 


CYTOCHROME B. 


1.73e+00 


16 102 


5.3 


379 


8 


047962 


CYTOCHROME B, 


1.73e+00 


17 102 


5.3 


380 


8 


048000 


CYTOCHROME B (FRAGMENT 


1.73e+00 


18 102 


5.3 


503 


8 


Q37612 


CYTOCHROME C OXIDASE S 


1.73e+00 


19 101 


5.2 


204 


8 


047043 


CYTOCHROME B (FRAGMENT 


2.25e+00 


20 101 


5.2 


348 


8 


047403 


NADH DEHYDROGENASE SUB 


2.25e+00 



21 100 


5.2 


348 8 


04/jyy 


NADH DEHYDROGENASE SUB 


^j^e+uu 


,-.22--.ra»100- - 


. 5.2 


■348 •■ 8- 


04>7398t-^-- 


■ NADH DEHYDROGENASE SUB 


^.y/e+uu 


23 101 


5.2 


379 8 


047991 


CYTOCHROME B. 


i.zue+uu 


24 101 


5,2 


379 B 


047965 


CYTOCHROME B. 


2.25e+00 




W 


379 8 


047992 


CYTOCHROME B. 


^.zoe+uu 


26 101 


5.2 


379 8 


047972 


CYTOCHROME B. 


2.25e+u0 




5.2 


379 8 


047966 


CYTOCHROME B, 


2.25e+00 


28 101 




379 8 


047974 


CYTOCHROME B. 


2.2De+uu 


29 101 








LiiUtnKUMt; b. 


^ .zje+uu 






379 8 




LnULHKUMil, D. 


z.y^e+uu 


31 101 








CYTOCHROME B. 


2.25e+00 


32 101 


5,2 


380 8 


047976 


CYTOCHROME B. 


2.25e+00 


33 101 


5,2 


380 8 


047964 


CYTOCHROME B. 


2.25e+00 


1 34 101 


5,2 


380 8 


047990 


CYTOCHROME B. 


2.25e+00 


35 101 


5,2 


380 8 


04/y/i 


CYTOCHROME B. 


2.25e+00 


36 101 


5,2 


495 8 


Q34463 


CYTOCHROME OXIDASE SUB 


2.25e+00 


37 101 


5,2 


550 2 


031692 


rnUSrnUiKAWSt&KA&Il olo 




38 101 


5,2 


573 2 


069251 


ENZYME I, 


2.25e+00 


.,..^.1™ 3S_.._,..99.,„. 


..5.,1„, 


.133„..8.. 


.04.7442..„.. 


..CYTOCHROME B (FRAGMENT 


3.80e+00 


40 99 


5,1 


247 1 


050081 


247AA LONG HYPOTHETICA 


3.80e+00 


41 99 


5,1 


345 5 


Q20963 


RELATED TO C. ELEGANS 


3.80e+00 


42 99 


5,1 


363 1 


026434 


PEROSAMINE SYNTHETASE, 


3.80e+00 


43 99 


5,1 


379 8 


048001 


CYTOCHROME B. 


3.80e+00 


44 99 


5,1 


379 8 


048364 


CYTOCHROME B. 


3.80e+00 


45 99 


5,1 


2206 5 


P91744 


VOLTAGE-DEPENDENT CALC 


3.80e+00 



ALIGNMENTS 



1 



PRELIMINARY; 



PRT; 216 AA. 



RESULT 

ID Q23483 

AC Q23483; 

DT Ol-NOV-1996 (TREMBLREL. 01, CREATED) 

DT Ol-NOV-1996 (TREMBLREL. 01, LAST SEQUENCE UPDATE) 

DT Ol-NOV-1996 (TREMBLREL. 01, LAST ANNOTATION UPDATE) 

DE COSMID ZK418. 

GN ZK418.5. 

OS CAENORHABDITIS ELEGANS. 

OC EOKARYOTA; METAZOA; ACOELOMATES; NEMATODA; SECERNENTEA; RHABDITIDA. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-BRISTOL N2; 

RX MEDLINE; 94150718. 

RA WILSON R,, AINSCOOGH R., ANDERSON K., BAYNES C, BERKS M., BONFIELD J., 

RA BURTON J,, CONNELL M., COPSEY T., COOPER J,, COULSON A,, CRAXTON M,, 

RA DEAR S,, DO Z,, DURBIN R,, FAVELLO A., FULTON L., GARDNER A., GREEN P., 

RA HAWKINS T., HILLIER L,, JIER M., JOHNSTON L., JONES M., KERSHAW J., 

RA RIRSTEN J., LAISTER N., LATREILLE P., LIGHTNING J., LLOYD C, 

RA MCMURRAY A,, MORTIMORE B,, O'CALLAGHAN M,, PARSONS J., PERCY C, 

RA RIFKEN L,, ROOPRA A., SAUNDERS D,, SHOWNKEEN R,, SMALDON N,, SMITH A,, 

RA SONNHAMMER E,, STADEN R,, SDLSTON J., THIERRY -MIEG J., THOMAS K., 

RA VAUDIN M,, VAUGHAN K., WATERSTON R., WATSON A., WEINSTOCK L., 

RA WILKINSON-SPROAT J., WOHLDMAN P.; 

RL NATURE 368:32-38(1994). 

""" [2] I 

RP SEQUENCE FROM N.A. 

RC STRAIN-BRISTOL N2; 

, RA FULTON L,; 
RL . SUBMITTED (APR-1994) TO EMBL/GENBANK/DDBJ DATA BANKS, 

RN [3] 

RP SEQUENCE FROM N,A. 

RC STRAIN=BRISTOL N2; 

RA WATERSTON R.; 

RL SUBMITTED (APR-1994) TO EMBL/GENBANK/DDBJ DATA BANKS. 

DR EMBL; U00047; G470373; -, 

SQ SEQUENCE 216 AA; 23931 MW; 8B1C508A CRC32; 

Query Match 32.8*; Score 636; DB 5; Length 216; 

Best Local Similarity 47.8*; Pred, No, 2,66e-90; 

Matches 76; Conservative 41; Mismatches 41; Indels 1; Gaps 1 

Db 1 MSFFHFINCFALAFAPYFIVYKYSGINEYSSIWKCATASGGYLLTQLAKLLIIATFFPAL 60 
I::||| lllllh nil II l|::||:::||| |: II: || |:|:;|||||: 



Best Available Copy 

Sun Jan 24 11:29:04 1999 



US-09-162-597-3.rspt 



Page 2 



Qy 1 MTLFHFGNCFALAYFPYFITyKCSGLSEYNAFWKCVOAGVTYLFVQLCKMLFLATFFPTW 60 
Db 61 DSEGFSIVPEFLKSSADIIDVIGLHLLMTNFLAGKGEVRFWGGLGWGFAHSVAHRLVLL 12o" 

; ;: Ihhl h hllhhh Mill ::;|;:|||: I : I ; I 

Qy 61 EGGIYDFIGEFMKASVDVADLIGLNLVMSRN-AGKGEYKIMVAALGWATAELIMSRCIPL 119 
Db _ 121 WVGARGTAFTWRWVQTSLDSSADLLVIVSLACLTWMITR 159 



Qy 120 WVGARGIEFDWKYIQMSIDSNISLVBYIVASAQVWMITR 158 



RESPLT 2 

ID 046089 PRELIMINARY; PRT; 1087 AA. 

AC 046089; 

DT Ol-JUN-1998 (TREMBLREL, 06, CREATED) 

DT Ol-JUN-1998 (TREMBLREL. 06, LAST SEQUENCE UPDATE) 

DT Ol-JUN-1998 (TREMBLREL, 06, LAST ANNOTATION UPDATE) 

DE 1 -EVIDENCE-PREDICTED BY MATCH. 

GN EG:63B12.9. 

OS DROSOPHILA MELANOGASTER (FROIT FLY). 

OC EORARYOTA; METAZOA; ARTHROPODA; INSECTA; DIPTERA. 

RN [1] 

RP SEQUENCE FROM N.A. 

RA FERRAZ C, VIDAL S., BRUN C, BUCHETON A., DEMAILLE J.G.; 

RL SUBMITTED (JAN-1998) TO EMBL/GENBANK/DDBJ DATA BANKS. 

RN [2] 

RP SEQUENCE FROM N.A, 

RA BENOSP,; 

RL SUBMITTED (FEB-1998) TO EMBL/GENBANK/DDBJ DATA BANKS. 

DR EMBL; AL021106; E1249577; •. 

SQ SEQUENCE 1087 AA; 123655 MW; C4CA9980 CRC32; 

Query Match 6,U; Score 119; DBS; Length 1087; 

Best Local Similarity 28.6%; Pred, No. l,50e-02; 

Matches 16; Conservative 18; Mismatches 19; Indels 3; Gaps 3; 

Db 266 LLLKRLIPKWLGERELDF-WLFHKAMQRHKSL-HFQGESNAISLAYH-ELYSWLR 318 

h: I II 1:1 I :;l I :::::: : || |: I ; ; : ;|| :| 
Qy 111 LIMSRCIPLWVGAR6IEFDWKYIQMSIDSNISLVHYIVASAQVWMITRYDLYHNFR 165 



RESULT 3 

ID Q36712 PRELIMINARY; PRT; 215 AA, 

AC 036712; 

DT Ol-NOV-1996 (TREMBLREL. 01, CREATED) 

DT Ol-NOV-1996 (TREMBLREL. 01, LAST SEQUENCE UPDATE) 

DT Ol-JUN-1998 (TREMBLREL. 06, LAST ANNOTATION UPDATE) 

DE PETB, PETD (RNA EDITING] , 

GN PETB. 

OS ZEA SP. 

OG CHLOROPLAST, 

OC EUKARYOTA; PLANTAE; EMBRYOBIONTA; MAGNOLIOPHYTA; LILIOPSIDA; 

OC COMMELINIDAE; CYPERALES; POACEAE, 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE; 94073219. 

RA FREYER R., HOCH B., NECKERMANN K., MAIER R.M., KOSSEL H,; 

RL PLANT J. 4:621-629(1993). 

CC -!- CATALYTIC ACTIVITY: QH(2) + 2 FERRIC YTOCHROME C - Q + 2 
CC FERROCYTOCHROME C. 

CC -!- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 

CC BOUND TO THE PROTEIN (BY SIMILARITY). 

DR EMBL; S67283; G453127; -, 

DR EMBL; S67282; G453127; JOINED. 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1. 

DR PFAM; PF00033; cytochromejj." 

KW CHLOROPLAST; ELECTRON TRANSPORT; RESPIRATORY CHAIN; TRANSMEMBRANE; 

KW HEME. 

SQ SEQUENCE 215 AA; 24195 MW; C2B9E74A CRC32; 

Query Match 5.84; Score 113; DB 8; Length 215; 

Best Local Similarity 23. 5»; Pred, No. 8.40e-02; 

Matches 20; Conservative 27; Mismatches 32; Indels 6; Gaps 6; 



Db 57jfYRPTVTEAFSSVQYIMTEANFGWLI -R- SV-HRWSASMMVLMMILHVFRVYLTGGFKKP- 113 - 

I : :| Ml;; I hi I : I :;;; |;|:| |:; :: I 
Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 114 REL-TWVTGWLAVLTASFGVTGYS 137 

I :l ll:|: ;;; ;; 
Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT 4 

ID Q32438 PRELIMINARY; PRT; 232 AA. 

, ■ AC Q32438; 

' DT Ol-NOV-1996 (TREMBLREL. 01, CREATED) 

„' DT Ol-NOV-1996 (TREMBLREL. 01, LAST SEQUENCE UPDATE) 

I DT Ol-JUN-1998 (TREMBLREL. 06, LAST ANNOTATION UPDATE) 

DE POT, ALT. PETB GENE PRODUCT. 

OS HORDEUM VULGARE (BARLEY) . 

OG CHLOROPLAST. 

OC EUKARYOTA; PLANTA; EMBRYOPHYTA; ANGIOSPERMAE; MONOCOTYLEDONEAE; 

OC CYPERALES; GRAMINEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

I RC STRAIN=CV, SABARLIS; 

RX MEDLINE; 89240047, 

RA REVERDATTO S , V , , ANDREEVA A . V . , BURYAKOVA A . A . , CHAKHMAKHCHEVA 0 . G . , 

RA EFIMOVV.A,; 

RL NUCLEIC ACIDS RES. 17:2859-2860(1989). . - — 

CC •!- CATALYTIC ACTIVITY; QH(2) + 2 FERRICYTOCHROME C =■ Q + 2 
CC FERROCYTOCHROME C, 

CC ■!■ COFACTOR: TWO HEME GROUPS (B552 AND B566) WHICH ARE NOT COVALENTLY 

CC BOUND TO THE PROTEIN (BY SIMILARITY). 

DR EMBL; X14107; G11596; -, 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1, 

DR PFAM; PF00033; cytochromeJj . 

KW CHLOROPLAST; ELECTRON TRANSPORT; RESPIRATORY CHAIN; TRANSMEMBRANE; 

KW HEME. 

SO SEQUENCE 232 AA; 25078 MW; F06F9FF4 CRC32; 

Query Match 5.8%; Score 113; DBS; Length 232; 

Best Local Similarity 23,5%; Pred, No, 8,40e-02; 

Matches 20; Conservative 27; Mismatches 32; Indels 5; Gaps 6; 

Db 74 YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 130 

I ; ;: :| I;||;: I hi I : I :;:: hhl h; ;; | 
Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 131 REL-TWVTGWLAVLTASFGVTGYS 154 

. I ;| Ihl; ;;; ;: 

Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



.,y' RESULT 5 

•'"'"ID Q33298 PRELIMINARY; PRT; 232 AA. 

' AC Q33298; 

DT Ol-NOV-1996 (TREMBLREL. 01, CREATED) 

DT Ol-NOV-1996 (TREMBLREL, 01, LAST SEQUENCE UPDATE) 

DT Ol-JUN-1998 (TREMBLREL. 06, LAST ANNOTATION UPDATE) 

DE' ALTERNATE PETB GENE PRODUCT. 

' GN PETB, 

OS ZEA MAYS (MAIZE) . 

OG CHLOROPLAST. ' ' i ' 

OC EUKARYOTA; PLANTA; EMBRYOPHYTA; ANGIOSPERMAE; MONOCOTYLEDONEAE; 

OC CYPERALES; GRAMINEAE. 

RN [1] ' ■ . 

RP SEQUENCE FROM N.A, 

RX MEDLINE; 88210525, 

RA ROCK CD,; BARKAN A., TAYLOR W,'C,; 

RL CURR. GENET, 12:69-77(1987). " 

CC -!- CATALYTIC ACTIVITY: QH(2) + 2 FERRICYTOCHROME C - Q + 2 
CC' FERROCYTOCHROME C. ' 

CC -!- COFACTOR: TWO HEME GROUPS (B562 AND B565) WHICH ARE NOT COVALENTLY 
CC BOUND TO THE PROTEIN (BY SIMILARITY). 
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DR EMBL; X05422; G12437; •. 

DR PROSITE; PS00192; CYTOCHROMEJJEME; L„ 

OR PFAM; PF00033; cytochrome Jj. 

RW CHLOROPLAST; ELECTRON TRANSPORT; RESPIRATORY CHAIN; TRANSMEMBRANE; 

KW HEME. 

SO SEQUENCE 232 AA; 25964 MW; 8B742506 CRC32; 

Query Match ■ 5.8%; Score 113; DB 8; Length 232; 

Best Local Similarity 23,5*; Pred. No. 8.40e-02; 

Matches 20; Conservative 27; Mismatches 32; Indels 6; Gaps 6; 

Db 74 YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 130 

I ; :l I |:| I ; I |:|:| |:; ;: I 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

Db 131 REL-TMVTGWLAVLTASFGVTGYS 154 

Qy 190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT 6 

ID 036515 PRELIMINARY; PRT; 232 AA. 

AC Q36615; 

DT Ol-NOV-1996 (TREMBLREL. 01, CREATED) 

DT Ol-NOV-1996 (TREMBLREL. 01, LAST SEQUENCE UPDATE) 

DT Ol-JDN-1998 (TREMBLREL. 06, LAST ANNOTATION UPDATE) 

DE APOCYTOCHROME B6. 

GN PETB. 

OS ORYZA SATIVA (RICE). 

OG CHLOROPLAST. 

OC EUKARYOTA; PLANTA; EMBRYOPHYTA; ANGIOSPERMAE; MONOCOTYLEDONEAE; 

OC CYPERALES; GRAMINEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RA COTE J.C, WU N.H., WU R.; 

RL PLANT MOL, BIOL. 11;873-874(1988) . ' 

CC ■!• CATALYTIC ACTIVITY; QH(2) + 2 FERRICYTOCHROME C - Q + 2 
CC FERROCYTOCHROME C. 

CC -!■ COFACTOR; TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 
CC BOUND TO THE PROTEIN (BY SIMILARITY). 

DR EMBL; M35995; G343210; -. 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1. 

DR PFAM; PF00033; cytochromejj. 

KW CHLOROPLAST; ELECTRON TRANSPORT; RESPIRATORY CHAIN; TRANSMEMBRANE; 

KW HEME, 

SQ SEQUENCE 232 AA; 26014 MW; 0864CFC6 CRC32; 

Query Match 5.8*; Score 113; DB 8; Length 232; 

Best Local Similarity 23,5*; Pred, No, 8,40e-02; 
Matches 20; Conservative 27; Mismatches 32; Indels 6; Gaps 6; 

Db 74 YYRPTVTEAFSSVQYIMTEANFGWLI-R-SV-HRWSASMMVLMMILHVFRVYLTGGFKKP 130 

I : :; :l I |:| I : I |:|:| |:; ;: | 

Qy 132 YIQMSIDSNISLVHYIVASAQV-WMITRYDLYHNFRPAVL-LLMFLSVYKAFVMETFVHL 189 

' I 

Db 131 REL-TWVTGWLAVLTASFGVTGYS 154 

I :l ::: :: 

Qy . 190 CSLGSWARLDARAWTGLLALKHFG 214 



RESULT 7 

ID 032730 PRELIMINARY; PRT; 300 AA. 

AC =032730; 

DT Ol-JAN-1998 (TREMBLREL. 05, CREATED) 

DT Ol-JAN-1998 (TREMBLREL. 05, LAST SEQUENCE UPDATE) 

■DT Ol-AUG-1998 (TREMBLREL. 07, LAST ANNOTATION UPDATE) 

DE ATTA2 PROTEIN. 

GN ATTA2. 

OS AGROBACTERIUM TUMEFACIENS. 

OC PROKARYOTA; GRACILICOIES; SCOTOBACIERIA; AEROBIC RODS AND COCCI; 

OC RHIZOBIACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 



RC STRAIN=C58; 

RX MEDLINE; 96359388. 

RA MATTHYSSE A.G., YARNALL H.A., YOUNG N.; 

RL J, BACTERIOL. 178:5302-5308(1996). 

DR EMBL; L63540; G2340071; -. 

DR PFAM; PF00528; BPDJransp, 

SO SEQUENCE 300 AA; 32289 MW; 2E642739 CRC32; 

Query Match 5,6*; Score 109; DB 2; Length 300; 

Best Local Similarity 25,7*; Pred. No. 2.58e-01; 

Matches 27; Conservative 28; Mismatches 41; Indels 9; Gaps 8; 

Db 114 LILTGCVLIPLWTSVLVRSYAWIVLLQRTGIINQALLSTKLIDEPLRMIYTQGAVIVAMT 173 

II:; I nil : : I :| : I :|: : ;;|| |: ; 
Qy 111 LIMSRC--IPLWVGARGIEFDWK-YIQMSIDSNISLVHYIVASAQVWMI-TRYDLYHNFR 156 

Db 174 HVLLPFMIMPIFAALRAIPPEYVQAAMNLGS-TRLHAFRAIIVPL 217 
:| :|::::: |: : :|: :|ll :|| I l|:: I 
,-Qy..._.,.:1.67_EAVLLMFLSyYKAF-VMET.-FVHLC:SlGSWARLDA:RAVVTGL.207 



RESULT 8 

ID 063292 PRELIMINARY; PRT; 271 AA. 

AC 063292; 

DT Ol-AOG-1998 (TREMBLREL. 07, CREATED) 

DT Ol-AOG-1998 (TREMBLREL. 07, LAST SEQUENCE UPDATE) 

DT Ol-AOG-1998 (TREMBLREL. 07, LAST ANNOTATION UPDATE) 

DE NADH DEHYDROGENASE SUBUNIT 5 (FRAGMENT). 

GN ND5. 

OS BYASA ALCINOOS. 

OG MITOCHONDRION. 

OC EUKARYOTA; METAZOA; ARTHROPODA; TRACHEATA; HEXAPODA; INSECTA; PTERYGOTA;. 

OC LEPIDOPTERA; UNCLASSIFIED DITRYSIA; BYASA. 

RN [1] 

RP SEQUENCE FROM N,A, 

RA YAGI I,, SASAKI G., TAKEBE H.; 

RL J. MOL. EVOL, 0:0-0(1998). 

DR EMBL; AB013145; D1029125; -, 

KW MITOCHONDRION. 

FT NONJER 1 1 

FT NONJER 271 271 

SQ SEQUENCE 271 AA; 31136 MW; 1CC996F1 CRC32; 

Query Match 5,5*; Score 105; DBS; Length 271; 

Best Local Similarity 34,4*; Pred, No. 5,88e-01; 

Matches 21; Conservative 11; Mismatches 27; Indels 2; Gaps 2; 

Db 195 YDEDYIMLKSMFMLLLMSLIVGSSLSWMILQYP-YMIFLPLSLKLMVIYVSLMGFLMGYM 253 

:| II ; : I: l|:|: III ;| I II I II : I ::|:| 

Qy 128 FDWKYIQMSIDSNISLVHYIVASAQVWMITRYDLYHNFRPAVLLLMFLSV-YKAFVMETF 186 

Db 254 V 254 
I 

Qy 187 V 187 

. I i .(: 

RESULT 9 

ID 027297 PRELIMINARY; PRT; 212 AA. 

AC 027297; 

DT Ol-JAN-1998 (TREMBLREL. 05, CREATED) 

DT Ol-JAN-1998 (TREMBLREL. 05, LAST SEQUENCE UPDATE) 

DT Ol-ADG-1998 (TREMBLREL. 07, LAST ANNOTATION UPDATE) 

DE CONSERVED PROTEIN. 

GN MTH1229, 

^ OS METHANOBACTERIUM THERMOAUTOTROPHICUM. 

OC ARCHAEBACTERIA; EORYARCHAEOTA; METHANOBACTERIALES; 

OC HETHANOBACTERIACEAE. 

■RN II] 

RP SEQUENCE FROM N.A. 

RC STRAIN=DELTA H; 

RX MEDLINE; 98037514. 

RA SMITH D.R., DOOCEITE-STAMM L,A,, DELOUGHERY C, LEE H.-M., DUBOIS J., 

RA ALDREDGE T., BASHIRZADEH R., BLAKELY D., COOK R,, GILBERT K., 
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RA HARRISON D., HOANG L., KEAGLE P., LUMM W,, POTHIER B., Qltl D., 

RA SPADAFORA R,, VICARE R., WANG Y,, WIERZBOWSKI J., GIBSON R., 

RA JIWANI N.,CAROS0 A,, BCSH D., SAFER H., PATWELL D,, PRABHAKAR S., 

RA MCDODGALL S., SHIMER G., GOYAL A,, PIETROVSKI S., CHURCH GJ,, 

RA DANIELS C,J,, MAO J. -I., RICE P., NOLLING J., REEVE J.N.; 

RL J. BACTERIOL. 179:7135-7155(1997). 

DR . EMBL; AE000890; G2622340; •■ 

SQ ' SEQOENCE 212 AA; 23956 MW; 7EB119AC CRC32; 

Query Match 5.4%; Score 105; DB 1; Length 212; 

Best Local Similarity 31.5*; Pred. No. 7,72e-01; 

Matches 23; Conservative 17; Mismatches 27; Indels 6; Gaps 6; 

Db 44 AWALSELGIKGSSIIIWIGPMDVOIVDYEDVELEIRSEEVL-HFIVEHFDE-OPSSLRL 101 

:|| :ll li; I |:|; : ;: I :;: I |; I |:|| : ; I 
Oy 105 GWATAEL-IMSRCIPLWVGARGI-EF-DWKYIQMSIDSNISLVHYIVASAQVWMITRYDL 161 

Db 102 AYHRQRILVMLLM 114 

II I hill 
Qy 162 -YHNFRPAVLLLM 173 



RESDLT 10 

ID 051297 PRELIMINARY; PRT; 310 AA. 

AC 051297; _ _ .„ 

DT Ol-JUN-1998 (TREMBLREL. 06, CREATED) 

DT Ol-JaN-1998 (TREMBLREL. 06, LAST SEQUENCE UPDATE) 

DT Ol-AUG-1998 (TREMBLREL. 07, LAST ANNOTATION UPDATE) 

DE CONSERVED HYPOTHETICAL INTEGRAL MEMBRANE PROTEIN, 

GN BB0317, 

OS BORRELIA BURGDORFERI (LYME DISEASE SPIROCHETE) . 

OC PROKARYOTA; GRACILICDTES ; SCOTOBACTERIA; SPIROCHETES; SPIROCHAETALES; 

OC SPIROCHAETACEAE. 

RN [1] 

RP SEQUENCE FROM N,A. 

RC STRAIN-ATCC 35210 / B31; 

RX MEDLINE; 98065943. 

RA FRASER CM., CASJENS S., HUANG W.M,, SUTTON G,G., CLAYTON R.A., 

RA LATHIGRA R., WHITE 0,, KETCHUM K.A., DODSON R,, HICKEY E.K., GWIHN M,, 

RA DOUGHERTY B,, TOMB J,-F., FLEISCHMANN R,D,, RICHARDSON D., 

RA PETERSON J., KERLAVAGE A.R., QUACKENBUSH J., SALZBERG S., HANSON M., 

RA VAN VOGT R., PALMER N., ADAMS M,D., GOCAYNE J,D., WEIDMAN J., 

RA taRBACK T., WATTHEY L., MCDONALD L., ARTIACH P., BOWMAN C, 

RA GARLAND S., FUJII C, COTTON M,D., HORST K., ROBERTS K., HATCH B., 

RA SMITH H.O., VENTER J.C; 

RL NATURE 390:580-586(1997). 

DR ,EMBL; AE001138; 02688213; -, 

DR TIGR; BB0317; •. 

SQ SEQUENCE 310 AA; 35680- MW; C8F8F32F CRC32; 

Query Match 5.4%; ' Score 104; DB 2; Length 310; 

Best Local Similarity 28.0%; Pred. No. l.Ole+00; 

Matches 23; Conservative 22; Mismatches 34; Indels 3; Gaps 3; 

Db 229 WFFRPYLVLGLTSGLG-WSSLIVAVISGFNYVYVLFFSLLFSILIEFNNFLNINYDFKY 287 

; ;| II:: III; I :! |:;| ; ill : I ■ I.II • 

Oy 12 LAYF-PYFITYKCSGLSEYNAFWKCVQAGVTYLFVQLCKMLFLATF-FPTWEGGIYDFIG 69 



Db 



288 EFIGLCQSIAIFISLFLIKARK 309 

II; :| :|:| |:::|: 
70 EFMKASVDVADLIGLNLVMSRN 91 



RESULT 11 

ID 047977 PRELIMINARY; PRT; 379 AA. 

AC 047977; 

DT Ol-JUN-1998 (TREMBLREL. 06, CREATED) 

DT Ol-JUN-1998 (TREMBLREL. 06, LAST SEQUENCE UPDATE) 

DT Ol-JUN-1998 (TREMBLREL. 06, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B. 

GN CYTB. 

OS THOMOMYS BOTTAE (BOTTA'S POCKET GOPHER).' 

OG MITOCHONDRION, 



CC 
CC 
CC 
CC 
DR 
DR 
KW 

KW HEME, 

SQ SEQUENCE 379 AA; 43115 MW; 93B942A8 CRC32; 

Query Match 5,4%; Score 104; DB 8; Length 379; 

Best Local Similarity 26,2%; Pred. Ho. l.Ole+00; 

,Matches 16; Conservative 18; Mismatches 23; Indels 4; Gaps 3; 

Db 71 RDVNYGWLIRYMHANGASLFFICLYIHIGRXIY-YGSY-LYNETWNVGILLLFLTMATAF 128 

1':;: I :|:: : | : :. 1 1 : : | ||;: : :||;||:; || 
Qy 124 RGIEFDW--KYIQMSIDSNISLVHYIVASAQVWMITRYDLYHNFRPAVLLLMFLSVYKAF 181 

Db 129 V 129 

I 

Qy 182 V 182 



EUKARYOTAE; MITOCHONDRIAL EUKARYOTES; METAZOA; CHORDATA; VERTEBRATA; 

MAMMALIA; EUTHERIA; RODENTIA; SCIUROGNATHI ; GEOMYIDAE; THOMOMYS 

[1] 

SEQUENCE FROM N,A, 
STRAIN-MVZ 156025; 
SMITH M,F,; 

MOL. PHYLOGENET, EVOL. 9:1-14(1998), 
-I- CATALYTIC ACTIVITY: QH(2) + 2 FERRICYTOCHROME C - Q + 2 

FERROCYTOCHROME C. 
-!- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 

BOUND TO THE PROTEIN (BY SIMILARITY). 
EMBL; D65267; G2894730; -. 
PROSITE; PS00192; CYTOCHROMEJJEME; 1. 

MITOCHONDRION; ELECTRON TRANSPORT; RESPIRATORY CHAIN; TRANSMEMBRANE; 



RESULT 12 




ID 


064515 PRELIMINARY; PRT; 527 AA. 


AC 


064515; 




DT 


Ol-AUG-1998 (TREMBLREL, 07, CREATED) 


DT 


Ol-AUG-1998 (TREMBLREL, 07, LAST SEQUENCE UPDATE) 


DT 


Ol-AUG-1998 (TREMBLREL, 07, LAST ANNOTATION UPDATE) 


DE 


YUP8H12R.2 PROTEIN, 




GN 


YUP8H12R.2, 




OS 


ARABIDOPSIS THALIANA (MOUSE-EAR CRESS). 


OC 


EUKARYOTA; PLANTA; EMBRYOPHYTA; ANGIOSPERMAE; DICOTYLEDONEAE; 


OC 


CAPPARALES; CRUCIFERAE, 


RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN-CV, COLUMBIA; 




RA 


THEOLOGIS A,, VYSOTSKAIA V,S., OSBORNE B,I,, SCHWARTZ J.R., 


RA 


FEDERSPIEL N.A., KWAN A., TORIUMI M., YU G,, OJI, 0, ARAUJO R., 


RA 


CHUNG E,, DEWER K., DIETRICfF,', ECKER J,rV,"mARZIALI A., OEFNER p',' 


RA 


DAVIS R.W.; 




RL'' 


SUBMITTED (OCT-1997) 


TO EMBL/GENBANK/DDBJ DATA BANKS. 


RN 


;[2] 




"RP' 


' SEQUENCE FROM N.A. 




RC 


STRAIN-CV, COLUMBIA; 




RA 


THEOLOGIS A,; 




RL 


SUBMITTED (SEP-1997) 


TO EMBL/GENBANK/DDBJ DATA BANKS. 


RN 


[3] 




RP 


SEQUENCE FROM N,A. 




RC 


STRAIN-CV. COLUMBIA; 




RA 


THEOLOGIS A,; 




RL 


SUBMITTED (OCT-1997) 


TO EMBL/GENBANK/DDBJ DATA BANKS. 


RN 


[4] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=CV, COLUMBIA; 




RA 


THEOLOGIS A,; 




RL'. 


SUBMITTED (OCT-1997) 


TO EMBL/GENBANK/DDBJ DATA BANKS. 'i ■ 


RN 


[5] ■■ 




RP( 


SEQUENCE FROM N,A.i' 




RC 


STRAIN-CV, COLUMBIA; 




RA 


THEOLOGIS; 




rl' 


SUBMITTED (MAY-1998) TO EMBL/GENBANK/DDBJ DATA BANKS, 


DR 


EMBL; AC002986; G3152576; -, 


SQ 


SEQUENCE 527 AA; 


57750 MW; 3B215C12 CRC32; ■ ' 



Query Match 



5,4%; Score 105; DB 10; Length 527; 



/) Best Available Copy 

, Sun Jan 24 11:29:04 1999 US-09-162"597-3.rspt Page 5 



Best Local Similarity 22.94; Pred. No. 7.72e-01; 

Matches 25; Conservative 28;. Mismatches 48; indels. .8;. Gaps-....8.;. 

Db 158 LTVFSPNIWVYAVLRFVNGFGRATIGTCALVLSIELVGKKWRGRVGIMSFFGFMLGFLSL 217 

I: I I I :; I: I :|:: |: : :|| ; |: | : || : 
Oy 53 LATFFPT-MEGGIYDFIGEFMRASVDVADLI-GLNLVMSRNAGK-GEYKIMVAALGWATA 109 

Db 218 PL-MA-YMN-RGSSWRILYAWISIPTIIYC-V-LVRFFVCESPRWLFVR 261 

I I: : : I ; I I I : : |;: I 

Qy 110 ELIMSRCIPLWVGARGIEFDWKYIQMSIDSNISLVHYIVASAOVWMITR 158 



RESULT 13 

ID Q18259 PRELIMINARY; PRT; 1410 AA, 

AC Q18259; 

DT Ol-NOV-1996 (TREMBLREL. 01, CREATED) 

DT Ol-NOV-1995 (TREMBLREL. 01, LAST SEOOENCE UPDATE) 

DT Ol-JDN-1998 (TREMBLREL, 06, LAST ANNOTATION UPDATE) 

DE SIMILARITY ALONG ENTIRE GENE TO CALCIUM CHANNEL ALPHA PROTEINS. 

GN C27F2,3. 

OS CAENORHABDITIS ELEGANS. 

OC EORARYOTA; METAZOA; ACOELOMATES; NEMATODA; SECERNENTEA; RHABDITIDA. 

m [l] 

RP SE(}UENCE FROM N.A. 

RX MEDLINE; 94150718. 

RA WILSON R., AINSCOUGH R., ANDERSON K,, BAYNES C, BERKS M., BONFIELD J,, 

RA BURTON J., CONNELL M,, COPSEY T., COOPER J,, COULSON A., CRAXTON M., 

RA DEAR S., DU Z., DURBIN R., FAVELLO A., FULTON L., GARDNER A., GREEN P,, 

•RA HAWKINS T., HILLIER L., JIER M,, JOHNSTON L., JONES M., KERSHAW J., 

RA KIRSIEN J., LAISTER N., LAIREILLE P,, LIGHTNING J., LLOYD C, 

RA MCMURRAY A,, MORTIMORE B., O'CALLAGHAN M., PARSONS J,, PERCY C, 

RA RIFKEN L., ROOPRA A., SAUNDERS D., SHOWNKEEN R., SMALDON N,, SMITH A., 

RA SONNHAMMER E., ST ADEN R., SULSTON J,, THIERRY -MIEG J,, THOMAS K., 

RA VAUDIN M., VAUGHAN K., WATERSTON R., WATSON A., WEINSTOCK L., 

RA WILKINSON-SPROAT J., WOHLDMAN P.; 

RL NATURE 368:32-38(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA KIRSTENJ.; 

RL SUBMITTED (NOV-1995) TO EMBL/GENBANK/DDBJ DATA BANKS. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA WATERSTON R,; 

RL SUBMITTED (NOV-1995) TO EMBL/GENBANK/DDBJ DATA BANKS. 

DR EMBL; U40419; G1065507; -. 

DR PFAM; PF00520; ion.trans. 

SQ SEQUENCE 1410 AA; 162922 MW; 57A8C35A CRC32; 

Query Match 5.4%; Score 104; DB 5; Length 1410; 

Best Local Similarity 24.7*; Pred. No. l.Ole+00; 
Matches 18; Conservative 22; Mismatches 27; Indels 6; Gaps 5; 

Db 913 IWMPNH-IEINSWAQLLMVCRAMRPLRVYALIP-HIRRVW-ELCRGFR-EILLVTILLV 968 

:|: : II:: I : I ; :| I : : ::: : :| : || :||: :| | 
Qy 119 LWVGARGIEFD-WKYIQMSIDSNISLVHYIVASAQVWMITRYDLYHNFRPAVLLLMFLSV 177 I 

Db 969 VLMFIFASFGVQL 981 

I; :| I;| 
Qy 178 YKAFVMETF-VHL 189 



RESP 14 

ID 017615 PRELIMINARY; PRT; 308 AA, 

AC 017615; J 

DT Ol-JAN-1998 (TREMBLREL, 05, CREATED) 

DT Ol-JAN-1998 (TREMBLREL. 05, LAST SEQUENCE UPDATE) 

DT Ol-AUG-1998 (TREMBLREL, 07, LAST ANNOTATION UPDATE) 

DE C29F3.6. 

OS CAENORHABDITIS ELEGANS, 

OC EUKARYOTA; METAZOA; ACOELOMATES; NEMATODA; SECERNENTEA; RHABDITIDA. 

RN [1] 

RP SEQUENCE FROM N.A, 

RA MATTHEWS L.; 



RL SUBMITTED (OCT-1996) TO EMBL/GENBANK/DDBJ DATA BANKS. 

■EN...42J™ - - 

RP SEQUENCE FROM N.A, 

RA WILSON R., AINSCOUGH R,, ANDERSON K., BAYNES C, BERKS M., BONFIELD J., 

RA BURTON J., CONNELL M., COPSEY T,, COOPER J., COULSON A., CRAXTON M., 

RA DEAR S., DU Z,, DURBIN R., FAVELLO A., FULTON L., GARDNER A., GREEN P., 

RA HAWKINS T., HILLIER L., JIER M,, JOHNSTON L,, JONES M., KERSHAW J., 

RA KIRSTEN J., LAISTER N,, LATREILLE P., LIGHTNING J., LLOYD C, 

RA MCMURRAY A., MORTIMORE B,, O'CALLAGHAN M,, PARSONS J,, PERCY C, 

RA RIFKEN L., ROOPRA A., SAUNDERS D., SHOWNKEEN R,, SMALDON N., SMITH A., 

RA SONNHAMMER E., STADEN R,, SULSTON J,, THIERRY-MIEG J,, THOMAS K,, 

RA VAUDIN M,, VAUGHAN K., WATERSTON R., WATSON A,, WEINSTOCK L., 

RA WILKINSON-SPROAT J., WOHLDMAN P.; 

RL NATURE 358;32-38(1994). 

DR EMBL; Z81043; E347909; 

DR PFAM;.PF00001; 7tra_l. 

SQ SEQUENCE 308 AA; 35499 MW; BB5A2150 CRC32; 

„ Query Match 5.3%; Score 103; DBS; Length 308; 

BesrLocaT'ilm~iTarity"''25,5%;" PreOo! i.32e+00; 
Matches 38; Conservative 40; Mismatches 59; Indels 12; Gaps 11; 

Db 61 YLFY-FCFMDFLDVFLFQE-NVFIFGFLVAVFHDVSVLTHFIISLNRFISVWCPIF-YKT 117 

III :l I II II : I: I I II :|:|| :l II 
Qy 42 YLFVQLCKMLFLATF-FPTWEGGIYDF-IGEFMKASVDVADLIGLNLVMSRNAGKGEYKI 99 

Db 118 MFN-LKYTKLFIFAVWLISFLIGS--L-FHIVLCRIRFNADLILFTIILAPSyCFEIGRY 173 

I I : ;; : I:; :|: : I :: :::;; | |:|;; j || 
Qy 100 MVAALGWATAELIMSRCIPLWYGARGIEFDWKYIQMSIDSNISLVHYIVASAQVWMITRY 159 

Db 174 GDLFRN-SCIVIIMMILDVF-TLIKVRLI 200 

||::| |;::|:| j: :::; :: 
Qy 160 -DLYHNFRPAVLLLMFLSVYKAFVMETFV 187 



RESULT 15 

ID 047963 PRELIMINARY; PRT; 379 AA. 

AC 047963; 

DT Ol-JUN-1998 (TREMBLREL, 06, CREATED) 

DT Ol-JUN-1998 (TREMBLREL, 06, LAST SEQUENCE UPDATE) 

DT Ol-JUN-1998 (TREMBLREL. 06, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B. 

GN CYTB, 

OS THOMOMYS BOTTAE (BOTTA'S POCKET GOPHER). 

OG MITOCHONDRION, 

OC EUKARYOTAE; MITOCHONDRIAL EUKARYOTES; METAZOA; CHORDATA; VERIEBRATA; 

OC MAMMALIA; EUTHERIA; RODENTIA; SCIUROGNATHI; GEOMYIDAE; THOMOMYS, 

RN [1] 

RP SEQUENCE FROM N,A, 

RC STRAIN-MVZ 166821; ■ 

RA SMITH M.F.; 

RL MOL. PHYLOGENET. EVOL. 9:1-14(1998). 

CC -1- CATALYTIC ACTIVITY; QH(2) + 2 FERRICYTOCHROME C = Q + 2 

CC FERROCYTOCHROME C. 

■ 1 "i CC -1- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY' 

CO. BOUND TO THE PROTEIN (BY SIMILARITY), 

DR EMBL; U65253; G2894702; -, 

DR PROSITE; PS00192; CYTOCHROMEJJEME; 1, 

KW MITOCHONDRION; ELECTRON TRANSPORT; RESPIRATORY CHAIN; TRANSMEMBRANE; 

KW HEME. 

■ ' SQ SEQUENCE 379 AA; 43114 MW; 9E37B5B0 CRC32; 

Query Match 5.3%; Score 102; DB 8; Length 379; 

Best Local Similarity 27,9%; Pred. No. 1.73e+00; 

Matches 17; Conservative 16; Mismatches 24; Indels 4; Gaps 3; 

Db 71 RDVNFGWLIRYMHANGASLFFICLYIHIGRGIY-YGSY-LYKETWNVGILLLFLTMATAF 128 

I ::| I :|:: : I ; ; || : : III : ; :||:||;: || 
Qy 124 RGIEFDW-KYIQMSIDSNISLVHYIVASAQVWMITRYDLYHNFRPAVLLLMFLSVYKAF 181 

Db 129 V 129 

■i I 
Qy : 182 V 182 



Best Available Copy 

Sun Jan 24 11:29:04 1999 



Search completed: Fri Jan 22 18:02:50 1999 
Job time : 94 sees, 
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